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PREFACE 


I wrote this book for myself. 

I wanted to piece together carefully my own path 
through Galois Theory, a subject whose mathematical 
centrality and beauty I had often glimpsed, but one which 
I had never properly organized in my own mind. I wanted 
to start with simple, interesting questions and solve them 
as quickly and directly as possible. If related interesting 
questions arose along the way, I would deal with them 
too, but only if they seemed irresistible. I wanted to ijvoid 
generality for its own sake, and, as far as practicable 7 , even 
generality that could only be appreciated in retrospect. 
Thus, I approached this project as an inquirer rather than 
as an expert, and I hope to share some of the sense of 
discovery and excitement I experienced. There is great 
mathematics here. 

In particular, the book presents an exposition of those 
portions of classical field theory which are encountered in 
the solution of the famous geometric construction 
problems of antiquity and the problem of solving 
polynomial equations by radicals. Some time ago much of 
this material was covered in undergraduate courses in the 
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‘theory of equations’. Paradoxically, as the theory 
matured and became more elegant, it also moved higher 
into the curriculum, so that nowadays it is not uncommon 
for it first to be encountered on the graduate level. It 
seems to me that this is most unfortunate, for this 
important and beautiful area of mathematics deserves to 
be studied early. It can then lend perspective and 
motivation to the later abstract study of mathematical 
structures. 

Simple mathematical questions and problems can be 
fascinating; theorems can often appear quite boring — 
unless they help us to deal with problems we are already 
thinking about. In this spirit, I have tried to develop the 
subject along the lines of various natural questions, 
usually stated at the beginnings of sections. Some of the 
mathematical gems encountered along the way relate to: 
angle trisection, duplication of the cube, squaring the 
circle, the transcendence of e and of ir, the construction of 
regular polygons, the irreducibility of the nth cyclotomic 
polynomial, the solution of polynomial equations by 
radicals, Hilbert’s irreducibility theorem, polynomials over 
the rationals with symmetric groups, the number of 
automorphisms of the field of complex numbers, and 
more. 

This book differs from the standard textbooks in several 
respects. First, the prerequisites are quite modest: basic 
calculus, linear algebra (including matrices, determinants, 
eigenvalues, and vector spaces), and the ability to follow 
precise mathematical arguments. No previous knowledge 
of groups, fields, or abstract algebra is needed. Second, I 
work entirely with subfields of the field of complex 
numbers. The richness of the subject is almost fully 
present here, and the concrete setting makes it easier to 
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grapple with and appreciate difficult material. Third, 
milestone developments appear almost immediately, in 
the first chapter, before we start to build up much theory. 
This provides momentum, and it also underlines the way 
the later theory represents the development of more 
powerful methods. Indeed, a number of problems are 
solved in several ways as we proceed. Fourth, the 
mathematics is ‘interdisciplinary’. I have the luxury of not 
needing to follow a certain syllabus in abstract algebra, 
say, and so I develop and use whatever I need to solve a 
problem. Thus honest use is made of calculus, linear 
algebra, number theory, and algebraic geometry, for 
example, although without exceeding the prerequisites for 
the book. Fifth, there are reasonably complete solutions to 
all 176 problems. Too often have I turned to reference 
books only to find my own question given as a problem; it 
is often impractical to take the time to work such things 
out. Sixth, each chapter closes with an annotated 
bibliography, which should enable the interested reader to 
get quickly to the original sources or to supplementary 
material. In all, I hope this book may serve a wide 
audience: bright undergraduates, even at the sophomore- 
junior level, where I used the first three chapters for a 
one-semester course, prospective and current high school 
teachers, non-mathematicians with an interest in 
mathematics, as well as, yes, graduate students and 
professional mathematicians, some of whom have already 
welcomed the manuscript in the spirit of its unofficial 
title: “What You Always Wanted to Know about Galois 
Theory (but were afraid to ask)”. 

In order to maintain a sense of continuity, the 
thirty-seven theorems are numbered consecutively 
throughout the text. Lemmas and corollaries assume the 
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number of the theorem they immediately precede or 
follow, respectively. There are certain instances where I 
use standard notions from calculus even though it is 
possible to work with formally identical counterparts that 
do not depend on calculus. For example, I treat 
polynomials as functions and their derivatives as these 
occur in calculus. I do not treat abstract groups or rings. 
It is true that in a few instances such concepts could be 
used to effect economies or more elegant arguments. It is 
also true that the similarities in a number of calculations 
point strongly and clearly at fundamental abstract 
notions. My opinion is that the basic ideas and techniques 
used are already so difficult for the uninitiated that 
further definitions, axioms, and abstractions would tend 
to obfuscate rather than clarify. (Such abstractions are 
occasionally pointed to, however.) 

I gratefully acknowledge the support and encourage- 
ment of the members of the Subcommittee on Carus 
Monographs during the preparation of this volume: 
Daniel Finkbeiner, George Piranian, and Ralph Boas. 
Both Daniel Finkbeiner and Wendell Lindstrom labored 
through the entire text and made numerous valuable 
suggestions. John Fay critically read parts of the manu- 
script, and Victor Albis, with mathematical and bib- 
liographical assistance, rescued me from despair during 
the writing of the last chapter. Support received during a 
year at Bowdoin College, especially the availability of the 
outstanding Smyth Mathematical Library, contributed 
significantly to this project. I apologize for any errors that 
may be found. 

I now take more seriously other authors’ expressions of 
gratitude to spouses and children for their sacrifices and 
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long suffering during such writing projects. To my wife 
Joanne and to my children, I express my thanks for such 
support and such endurance. If there will be readers who 
through this book find the same excitement and 
enjoyment in the material as I did, then I think it will all 
have been worth it. 


Charles R. Hadlock 


Cambridge, Massachusetts 
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INTRODUCTION 


Mathematical knowledge has undergone explosive 
growth in the past few hundred years. No doubt, a 
significant portion of this growth is due to the usefulness 
of mathematics as a tool in many fields. However, 
probably the greatest single stimulus to the development 
of this subject is its natural intellectual appeal — the 
challenge of a hard problem and the satisfaction of 
resolving it. For centuries these factors have motivated the 
efforts of mathematicians. 

Problems which are simple to state and understand are 
quite reasonably often the most attractive. It is not 
uncommon for such problems to stimulate extensive 
research by many mathematicians, and through their 
solution to lead to results and insights far beyond their 
original bounds. The solution of one problem inevitably 
leads to the posing of others, and thus the subject of 
mathematics continues to grow. 

The theories emerging from the solution of fundamental 
problems continue to be refined and generalized, for part 
of the work of the mathematician is to abstract from one 
situation its fundamental features and to apply these to 
other situations that are similar. A mature theory may 
bear scant resemblance to the solution of the original 

1 
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problems upon which it is founded. And the abstract 
exposition of such a theory may not stimulate in those 
who would study it the intellectual excitement and 
curiosity which led to its development. 

What are some of these early problems that have given 
birth to great theories? In this book, we shall be 
concerned with the following: 


A. The Three Greek Problems 

Most people have learned in school how to perform 
certain geometric constructions by means of a compass 
and an unmarked straightedge. For example, it is quite 
simple to bisect an angle, to construct the perpendicular 
bisector of a line segment, or to construct the line through 
a given point parallel to a given line. With the same 
instruments and rules of construction, is it possible to 

(1) double a cube, meaning to construct the edge of a 
cube whose volume would be twice that of a cube with a 
given segment as an edge; 

(2) trisect an angle, meaning to divide an arbitrary angle 
into three equal parts; or 

(3) square a circle, meaning to construct a square having 
the same area as a given circle? 

These problems date from the fifth century b.c., and 
they have a long and interesting history. For example, 
legend has it that ancient Athens, being faced by a serious 
plague, sent a delegation to the oracle of Apollo at Delos 
for advice in their difficulty. The delegation was told to 
double the cubical altar to Apollo. Unfortunately, they 
doubled the length of each edge, thereby increasing the 
volume by a factor of eight rather than two; and the 
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plague only got worse. From this story, the problem of 
doubling a cube has received the name “the Delian 
problem”. For detailed information on the history of these 
problems, the reader may wish to consult a book on the 
history of mathematics, several of which are listed later in 
this section. 

With respect to Problems 1 and 3, the construction is 
never possible. With respect to Problem 2, there exist 
angles that cannot be trisected, among them the angle of 
60°. One should pay careful attention to the nature of 
these results. They do not simply assert that no 
construction is known, that none has been discovered; 
they actually say that none can exist, which is a much 
stronger statement. It is natural to wonder how such a 
negative result might be established. It will not be 
necessary to wait long, as these problems are solved in the 
first chapter. 

Historically, however, it took about 2200 years before 
solutions were discovered, despite frequent periods of 
frantic effort. It is hard to say just when Problem 1 was 
first solved. A solution to Problem 2 came as a special 
case of more general results by Carl Friedrich Gauss 
(1777-1855) and Pierre Wantzel (1814-1848) on the 
problem of finding those values of n for which it is 
possible to construct a regular polygon with n sides. Gauss 
made substantial progress on this problem by the age of 
eighteen! (We shall take up this latter problem in Chapter 
2.) Problem 3 was solved by F. Lindemann in 1882 [Uber 
die Zahl m, Math. Ann., 20 (1882), 213-225]; it is 
somewhat more difficult than the first two. For a 
fascinating history of many false solutions to this problem 
(including a variety of methods — mathematical, religious, 
etc.), it is well worth searching out a copy of the book by 
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F. J. Duarte, Monografia sobre los Numeros it y e [Estados 
Unidos de Venezuela, Bol. Acad. Ci. Fis. Mat. Nat., 11 
(1948), no. 34-35, 1-252 (1949); Math. Reviews, 11-501], 


B. The Solution of Polynomial Equations by Radicals 

The reader is probably familiar with the ‘quadratic 
formula’, which asserts that the solutions to the quadratic 
equation 

ax 2 + bx + c = 0 

are given by 


— b± \fe 2 — 4ac 
X 2a 

Can a similar formula be found for the general poly- 
nomial equation of degree 3: 

aj c 3 + bx 2 + cx + d = 0 ? 

How about polynomials of degree 4, 5, or, in general, «? 
By a ‘similar formula’, let us agree to mean some definite 
procedure for calculating the solutions from the 
coefficients of the polynomials, using the rational 
operations (addition, subtraction, multiplication, and 
division) and the extraction of roots. Solutions obtained 
using these operations are called ‘solutions by radicals’. 

It turns out that for polynomials of degree less than 
or equal to 4, such a procedure exists, whereas for 
polynomials of higher degree, no such general procedure 
exists. Indeed, there are polynomials of every degree 
greater than or equal to 5 with roots which cannot be 
obtained from the coefficients by any sequence of rational 
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operations and the extraction of roots. Again, this is a 
very strong negative result. 

The formula for the quadratic equation seems first to 
have been discovered by the Arabs around 900 a.d., 
although quadratic equations had been solved by the 
Babylonians 3000 years earlier. The general cubic 
equation was probably first solved by Scipione del Ferro 
(1465-1526) and the quartic equation by Ludovico Ferrari 
(1522-1565). For the next few centuries, considerable 
efforts were directed towards the quintic equation, with 
the expectation that a general formula would eventually 
be found. However, towards the end of the eighteenth 
century, a number of mathematicians began to suspect 
that perhaps it was not possible to obtain a formula for 
the solution of the general quintic equation. The work of 
Joseph Lagrange (1736-1813) had considerable effect on 
this change of attitude. Beginning in 1799, Paolo Ruff ini 
(1765-1833) published several somewhat deficient proofs 
of this negative result. Finally, Niels Henrik Abel 
(1802-1829) produced in 1828 an essentially correct proof 
of the unsolvability of the general quintic. Soon thereafter, 
at the age of nineteen, Evariste Galois (1811-1832) 
perfected the theory by giving a simple necessary and 
sufficient condition for the solvability by radicals of a 
polynomial equation. The theorems of Abel and Galois 
are developed in Chapter 3 of this book. 

While it is far from apparent in their statement, the 
solutions of Problems A and B use many of the same 
ideas and techniques. These ideas have led to the modern 
theories of groups and fields. Rather than focus on the 
abstract theories in this book, we shall simply imagine at 
the outset that we face these problems, and as we proceed 
we shall develop the ideas necessary for their solution. It 
is hoped that the concreteness of this approach will serve 
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to invite the readers to ask their own questions, to wonder 
freely, and to pursue matters of individual interest along 
the way. At the end, a powerful theory will have been 
developed, and to convert it to its usual abstract 
formulation is then a relatively small and easy step. For 
this abstract formulation, the reader may wish to consult 
textbooks on abstract algebra. 

The lines of the development here are not, in general, 
those of the original proofs; rather, they are heavily 
influenced by modem approaches and terminology. The 
reader may be interested to take a look at some of the 
original papers on this material, especially the works of 
Abel, Galois, and Gauss. The references are: 

Niels Henrik Abel, Oeuvres completes, 2 vols., ed. by L. 
Sylow and S. Lie, Grondahl and Sons, Christiana, 1881. 

Evariste Galois, Ecrits et Memoires Mathematiques, ed. 
by R. Bourgne and J.-P. Azra, Gauthier-Villars, Paris, 
1962. 

Carl Friedrich Gauss, Werke, 9 vols., Konigliche 
Gesellschaft der Wissenschaften, Gottingen, 1863-1906. 

, Disquisitiones Arithmeticae, tr. by A. Clarke, 

Yale University Press, New Haven, 1966. 

While it is easy to pass such references by, even a cursory 
examination of the original works lends a certain valuable 
historical perspective to the mathematical developments. 
(And the scrap papers and doodlings of these famous 
mathematicians are sometimes the greatest contributors in 
this respect. There are some samples of Galois’ in his 
collected works; for Abel, photostats are available in the 
biography by Oysten Ore [Univ. of Minnesota Press, 
1957].) An extensive paper tracing the historical evolution 
of the theory into its modem form has been written by B. 
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Melvin Kiernan [The Development of Galois Theory from 
Lagrange to Artin, Arch. History Exact Sci., 8 (1971-72), 
40-154]. It may also be interesting to read concurrently 
some material on the history of mathematics. Good 
sources are: 

E. T. Bell, Men of Mathematics, Simon and Schuster, 
New York, 1937. 

Carl Boyer, A History of Mathematics, Wiley, New 
York, 1968. 

Morris Kline, Mathematical Thought from Ancient to 
Modern Times, Oxford University Press, New York, 1972. 

David Eugene Smith, History of Mathematics, vols. 1 
and 2, Dover, New York, 1958. 

One cannot help but be awed by the magnificent 
accomplishments of Abel and Galois, especially during 
their short and tragic lives, the former having died at age 
twenty-nine and the latter having been killed in a foolish 
duel before the age of twenty-one! 

Before ending the Introduction, it is necessary to say a 
word about the problem sets at the end of each section in 
the main text. Mathematics is not a “spectator sport”; it is 
necessary to read mathematics with paper and pencil at 
hand. The problems at the end of each section are 
extremely important. One should read through all the 
problems, for many of them are referred to later on. A few 
minutes should be spent on each, enough to think through 
the solution in cases when the problem is very simple, or 
to plan a possible attack in more difficult cases. Then, 
those problems that are appealing to the reader should be 
tried in their entirety. Particularly challenging problems 
are marked with an asterisk. Lastly, for any problems that 
the reader has thought about but has not been inclined or 
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able to give a complete solution, it may be helpful to 
consult the solutions given in the back of the book. 
But — and this is important — it is largely unproductive to 
read solutions before at least some thought has been given 
to the problems. 

It should be kept in mind that the persons who first 
solved the problems in this book probably did so largely 
because of the sheer intellectual appeal and challenge of 
the problems; they were fun. It is hoped that the reader 
will find similar attractions in this material. 



CHAPTER 1 


THE THREE GREEK PROBLEMS 


Section 1.1. Constructive Lengths 

An important step in the solution of the classical Greek 
construction problems will be taken in this section, where 
we shall answer the question: Given a line segment of 
length 1 in the plane, for what values of a can we construct 
a segment of length a? As usual in geometry, we shall 
always consider lengths to be non-negative. To begin, 
naturally, we need to take a careful look at the rules 
which govern the construction, where by “construction” 
we refer to a finite sequence of operations with a compass 
and an unmarked straightedge. The individual operations 
which may be done with these instruments are called the 
fundamental constructions', they are: 

(1) Given 2 points, we may draw a line through them, 
extending it indefinitely in each direction. 

(2) Given 2 points, we may draw the line segment 
connecting them. 

(3) Given a point and a line segment, we may draw a 
circle with center at the point and radius equal to the 
length of the line segment. 

It is tempting to add one additional operation, namely: 

Given a point, we may draw a circle with center at that 

9 
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.P 



A circle (or an arc thereof) is drawn with center P and any radius greater 
than the perpendicular distance from P to the line. 

Figure 1 


point and indefinite radius, possibly restricted by some 
inequality. 

The reader may recall such an operation, for example, in 
the first step in constructing the line perpendicular to a 
given line and through a given point P not on that line, as 
in Figure 1. However, it will follow easily from Lemma la 
that, given a segment of unit length, all segments of 
rational lengths are constructible. Hence, within any 
restricted interval of lengths, we can always find one of 
the type prescribed in operation 3. Thus it is without loss 
of generality that the fourth operation is omitted. 

Using the simple constructions of high school geometry, 
it is possible to show: 

Lemma la. Given segments of lengths 1, a, and b, it is 
possible to construct segments of lengths a + b, a - b ( when 
a > b), ab, and a/ b ( when b # 0). 

Proof. The first two should be obvious. To obtain the 
product ab, first construct two non-collinear rays 
emanating from a single point; for example, construct two 
perpendicular rays. (You may follow this construction on 
the first triangle in Figure 2.) It should be clear that this 
can be accomplished beginning only with a single (unit) 
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segment. On these rays, mark off segments of lengths a 
and 1 respectively, and connect the resulting points to 
form a triangle. On the ray on which the length 1 has been 
marked off, now mark off a length b also from the end. If 
a line is now constructed through this point parallel to the 
third side of the triangle constructed earlier, it intersects 
the other ray at a distance ab from the endpoint of the 
ray, as follows at once by the similarity of the triangles. 
The second triangle in Figure 2 pertains to the analogous 
argument for the quotient a/b, for which the reader 
should think through the steps. ■ 

Let us agree to call a real number, a, constructible if, 
given initially a segment of length 1, it is possible to 
construct a segment of length |a|. By consideration of 
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various cases for the signs and relative sizes of numbers a 
and b, it is an immediate consequence of Lemma la that 
if a and b are construe tible, then so too are a + b, a — b, 
ab, and, when b ^ 0, a/b. (Cf. Problem 1.) From this it 
follows that all the rational numbers are constructible, for 
any integer can be formed by adding or subtracting an 
appropriate number of l’s, and so then any quotient of 
integers can be obtained. 

We shall find it very convenient in our work to study 
certain special sets of numbers, called fields. Let F be a 
subset of the set R of real numbers. We say that F is a 
field if it satisfies the following two conditions: 

(1) F is closed under the rational operations (addition, 
subtraction, multiplication, and division except by 0), 
meaning that whenever these operations are applied to 
elements of F, the result is an element of F. 

(2) The number 1 is an element of F. 

(The second condition simply excludes the two trivial 
cases F = 0 and F = {0}.) It is obvious that the rational 
numbers Q and the real numbers R are fields; and. by 
Lemma la, so too is the set of constructible numbers. 
Many other fields will soon appear. (Later on, the 
definition of field will be generalized to include sets of 
numbers which are not necessarily subsets of R.) The set 
Z of all integers is an example of a familiar set which is 
not a field, since it is not closed under division. 

In our search for some description of those numbers 
which are constructible, it is important to note that it may 
be possible, using only lengths in some field F, to 
construct a length not in F. In particular, this will be the 
case whenever F is not closed under the operation of 
taking the square root of a positive number, as we now 
show. 
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Figure 3 


Lemma lb. Given segments of lengths 1 and a, a 
segment of length fa may be constructed. 

Proof. Figure 3 indicates one such construction. 
Adjacent segments of lengths a and 1 are marked off on a 
line. The midpoint of this combined segment is then 
constructed, and from this the circle having the segment 
for a diameter. A perpendicular constructed from the 
point where the original two segments were joined 
intersects the circle at a distance fa from the diameter, as 
follows from the similarity of the two small triangles in 
the diagram. ■ 

Thus we are able to deduce from this and the previous 
lemma that if all the numbers of a field F are construct- 
ible, then so too are all the numbers of the form a + bfk , 
where a, b, and k are in F and k > 0. What is interesting 
is that for a fixed element k £ F, k > 0, the set of all 
numbers of the form a + bfk itself forms a field! It is 
called the extension ofFbyfk and is denoted F(VF ). Of 
course, if fk is in F, this set is simply F itself. But if fk is 
not in F, this new set contains F as a proper subset. In 
this case F (fk ) is called a quadratic extension of F. 

Lemma lc. If F is a field and if k e F, k > 0, then 
F (fk ) is also a field. 
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Proof. Since IGF, 1 G F (fk ). All we need to show 
now is that the sum, difference, product, and quotient of 
numbers in F (fk ) are themselves in F(fk ), that is, that 
they have the form A + B-fk with A, B G F. We have: 

(a + bfk ) ± (c + dfk ) = (a ± c) + (b ± d)fk , 

(a + bfk )(c + dfk ) = (ac + bdk) + (ad + bc)fk , 

a + bfk _ a + bfk c - dfk 

c + dfk c + dfk c - dfk 


1 ac - bdk \ 

I be- ad \ 

V c 2 -d 2 k I 

l c 2 - d 2 k ) 


Of course, with respect to the division part, we must be 
careful that our denominators are not zero. By assump- 
tion, c + cbfk # 0. If it happens to be the case that 
c — dfk = 0, it follows that c + dfk — 0 = c + dfk — 
(c - dfk ) = 2 cbfk i=- 0, and so d ^ 0. But then we would 
have fk = c/d G F, and so the original quotient would 
be in F and hence F (fk ) by inspection. In any case, we 
see that the result of each of the arithmetic operations is a 
number of the required form. ■ 

We know now that every rational number is construct- 
ible and that every number that can be calculated from 
the rationals by a finite sequence of the rational 
operations and the square root operation is 
also constructible. For example, the number 

+ y 1 + l/T is constructible, as we can build it 
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up by the following sequence: 

7, V7 , yfl +V7 = 2VT , 1, 1 + 2/f , }fl + l/f , 
6, V6 , V6 +Vl +2V7 , ^ +Vl +2V7 , f , 
+ Vl + 2V7 , 13, Vl3 , = W , 

far + f\/v § ’ + Vi + 2 ^ . 


This operation may be conveniently restated in terms of 
fields: 

Lemma Id. /I number a is const rue title if there is a 
finite sequence of fields Q = F 0 C F, C • • • C F^, with a 
E ¥ n , and such that for each j, 0 < j < N - \, F J+l is a 
quadratic extension of F f 

Proof. By induction on N. If N = 0, a is rational and 
hence constructible. Now we assume the theorem is true 
for N = R and we prove it for N = R + 1. If a E F R+1 , 
then a may be expressed as a R + b R ^k R , where a R , b R , 
and k R all belong to F*. By the inductive hypothesis, a R , 
b R , and k R are all constructible; and so, by Lemmas lb 
and la, a = a R + b R y[k^ is constructible. ■ 

As an example, one such sequen ce of fields for the 
number t/TT + § yVfT + yl+2VT , which was treated 
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just before the lemma, is: 

F 0 = Q, F, = F 0 (V7 ), F 2 = F,(Vl + W ), 

F 3 = F 2 (V6), F 4 = F 3 (VV6 +0 +2V7 ), 

F 5 -F 4 (Vl3), F 6 = F 5 (V/rT), 

which is a field containing the number. Such a sequence is 
simply determined by following through the steps of the 
construction and extending the field each time a square 
root not already in the field is introduced. If one were 
actually interested in proving that one had a sequence of 
fields precisely as described in the lemma, namely a 
sequence of quadratic extensions, it might take some work 
to determine at which steps a truly larger field is 
produced. For example, could you show in the case above 
that Vl3 ^F 4 ? For our purposes, such calculations will 
not be important. Incidentally, there are many different 
paths that one could follow in building up the 
complicated number above, or any others. Different paths 
may give different sequences of fields. (See Problem 9.) 

Lemma Id characterizes a certain class of constructible 
numbers. Are there any other constructible numbers? The 
answer is “no”; in order to see this, we shall make use of 
some elementary analytic geometry. 

If F is a field, the plane of F will denote the set of all 
points (x,y) in the Cartesian plane such that both x and y 
are in F. By a line in F is meant a line passing through two 
points in the plane of F. By a circle in F is meant a circle 
with both its center and some point on its circumference 
in the plane of F. The importance of these notions is the 
fact that apy fundamental construction using only points 
in the plane of a field F involves the construction of (a 
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portion of) a line or a circle in F. For the first two 
fundamental constructions this is obvious by the 
definition of a line in F. For the third fundamental 
construction, that of a circle with a given center and a 
radius equal to the length of the segment connecting two 
given points, suppose that the center is (x„ y x ) and the 
segment is that connecting (x 2 , y 2 ) and (x 3 , y 3 ), where all 
the coordinates are in F, as in Figure 4. It is now easy to 
see that the desired circle is in the plane of F, for its center 
is in the plane of F and it passes through (x, + x 3 - x 2 , 
y x + y 3 - y 2 ), which is also in the plane of F. 

We find further that lines and circles in F can be 
described by simple equations, all of whose coefficients 
are themselves in F, as stated below: 

Lemma le. Every line in F can be represented by an 
equation of the form ax + by + c = 0, where a, b, and c are 
all in F. Every circle in F can be represented by an equation 
of the form x 2 + y 2 + ax + by + c = 0, where a, b, and c 
are all in F. 

Proof. In the case of the line, let (x,,/,) and (x 2 ,_y 2 ) be 
two points on it and in the plane of F. If x, = x 2 , then of 
course the line is described by the equation x = x„ which 
has the desired form (a = 1, b = 0, c = - x,). If x, ^ x 2 , 
then the equality of slopes, 

y-y\ = y2~y t 

x - x x x 2 — X, 


yields the equation 

{y 2 -yd x + (*i - *2 )y + (/i*2 - *i j 2 ) - o. 

which is also of the desired form. 
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In the case of the circle, denoting the center by (x lv y,) 
and a point on it and in the plane of F by ( x 2 ,y f), the 
equation is 

(x - x,) 2 + (y-yi) 2 = ( * 2 “ *.) 2 + (y 2 ~y\f- 
This clearly reduces to the desired form. ■ 

Since the only way to construct a segment is by 
constructing its endpoints, and since the only way to 
construct a point is as the intersection of two lines, a line 
and a circle, or two circles, it is important to determine 
the nature of such intersections. In particular, we have: 

Lemma If. The point of intersection of two distinct , 
nonparallel lines in F is in the plane of F. The points of 
intersection of a line in F and a circle in F, as well as the 
points of intersection of two distinct circles in F, are either in 
the plane of F or in the plane of some quadratic extension 
of F. 

(Note that the plane of F is contained in the plane of any 
F (V*), so that the words “either in the plane of F or” 
could be omitted without changing the meaning of the 
lemma. It is more convenient to think of it in the form 
given, however.) 

Proof. The case of the intersection of two lines in F 
amounts to the simultaneous solution of the two equations 

a l x + b l y + c = 0 

a 2 x + b 2 y + c = 0 

where the coefficients are, by Lemma le, all in F. It is 
obvious that this only involves rational operations, and so 
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the solutions x and y are in F, which means that (x,y) is in 
the plane of F. 

The case of a line in F and a circle in F amounts to the 
simultaneous solution of the two equations 

a x x + b x y + c, = 0 

x 2 + y 2 + a 2 x + b 2 y + c 2 = 0 

where the coefficients are all in F. Since a x and b { cannot 
both be 0, the first equation can be used to solve for one 
of the variables in terms of the other. Without loss of 
generality we assume that we can solve for 


-c, a x 



When this is substituted into the second equation, we 
obtain a quadratic equation in x with coefficients in F. 
From the quadratic formula, its solution, which we need 
not calculate explicitly, has the form A ± B\[k , with A, B, 
and k in F and k > 0 (otherwise the line doesn’t intersect 
the circle). Substituting this in the equation for y also 
yields an expression of the form A' ± B'y[k . Thus the 
resulting points are either both in the plane of F, as when 
yfk GF, or both in the plane of F(Jk ), Jk $F. (When 

k = 0, of course, we only have one point; and it is in the 

plane of F.) 

For the case of two circles, the equations 

x 2 + y 2 + a x x + b x y + c, = 0 

x 2 + y 2 + a 2 x + b 2 y + c 2 = 0 

can be subtracted, yielding a linear equation, with 
coefficients in F, to be solved simultaneously with the 
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equation of either of the circles. Thus this reduces to the 
situation in the previous case. ■ 

We have finally reached the point where it is possible to 
state and prove the main result concerning constructibil- 
ity, thus answering the question posed in the first sentence 
of this section. 

Theorem 1. The following two statements are equiva- 
lent: 

(i) The number a is constructible . 

(ii) There exists a finite sequence of fields Q = F 0 c 
F, C • ■ • C F^, with a G F N , and such that for each j, 
0 < j < N — \, F, + , is a quadratic extension of F f 

Proof, (ii) ==> (i). This is simply a restatement of 
Lemma Id. 

(i)=»(ii). If we are given the points (0, 0) and (1, 0) in 
the Cartesian plane, then by hypothesis a segment of 
length |a| can be constructed. It is clear that we may use 
this segment, the point (0, 0), and the line through (0, 0) 
and (1, 0), to construct the point (a, 0), which we shall call 
P. Thus it suffices to show that P is in the plane of a field 
F n of the type described in (ii), that is, a field obtainable 
from Q by a sequence of quadratic extensions. 

Now, the construction of P involves a finite number of 
the fundamental constructions, each of which results in a 
finite number of new points as intersections of various 
types. We list all such points in order of construction, 
where points resulting on the same step may be listed in 
any order. The point P is on this list, say in the A/th 
position. Omitting anything in the list after P (which 
could only include points constructed on the same step as 
P), we have: Pn / > 2> • • • ’ Pm- i> Pm- The theorem will be 
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proved once we have established the following claim: 
There exists a field F, obtainable from Q by a sequence of 
quadratic extensions, such that P\, Pi, ■ • • , P M are all in 
the plane of F. Since P j and P 2 must be the two given 
points (0, 0) and (1,0), which are in the plane of Q, the 
claim holds for M = 1 and 2. To show that it holds for 
any M , we recall that the construction of Pm only involves 
figures constructed using (some subset of) the points P t 
through Pm - .. and hence, by the inductive hypothesis, 
figures in the plane of some field F obtainable from Q by 
a sequence of quadratic extensions. But then, by Lemma 
If, P M is in the plane of either F or F(V& ), for some k 
eF with VF ^F. In either case, Pm > as well as the pre- 
vious P;s, are all in the plane of a field of the required 
type. a 

The reader should take note of the fact that this 
theorem sets up an equivalence between two properties, 
one of which is geometric and one of which is algebraic. It 
is the conversion of our classical geometric problems to 
algebraic problems, via this theorem, that provides the key 
to their solution. This will be shown in the next few 
sections. 


PROBLEMS 

1. Use Lemma la to show that if a and b are 
constructive, then so too are a + b, a - b, ab, and, when 
b 0, a/ b. 

2. If M and N are positive integers, show that if 
W is rational, then, in fact, it must be an integer. 
Conclude then that \/2 , VT , ^2 , V6 , for example, are all 
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irrational. (Hint: If is rational it may be expressed in 
lowest terms as Q/ R. If R # ± 1 it has a prime factor. If 
a prime divides a product of integers, it must divide one of 
them.) 

*3. We know from Lemma lc that if F is a field, so 
too is the set {a + b^2 \ a, b 6 F). What about the set 
{a + bfl \a, i £F}? Is it ever a field? Is it always a 
field? Give as complete an analysis as you can. 

4. Let f be a collection of fields. Show that 
the common intersection of all the fields in *5, written 
fl Fe? F, is itself a field. 

*5. Give an explicit description of the smallest field 
containing fl . (By the smallest field having a given 
property is meant the intersection of all fields with that 
property. Problem 4 guarantees that such an intersection 
is a field.) 

*6. If p is a prime and N is a positive integer > 2, 
give an explicit description of the smallest field contain- 
ing N {p . (This is a generalization of the previous 
problem.) 

7. Is {a + bf2 + cV3~ | a, b, c £ Q} a field? 

8. Give an explicit representation of the smallest 
field containing both & and V3~ . (Hint: Use one of the 
lemmas.) 

9. For the example treated after Lemma Id, give a 
different development of the same number from the 
rationals which results in a different sequence of field 
extensions. 

10. Let F be a field. Is the set {a 4- b\fk | a, b, k G F, 
k > 0} necessarily a field? (This differs from ¥(fk ) 
because in the present case k is not fixed, but rather it 
ranges through all positive values in F.) 

11. The following argument is occasionally found in 
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proofs of Theorem 1 : “Using only points in the plane of a 
field F, by a single fundamental construction it is only 
possible to construct points in the plane of some F(VaT ), 
k G F.” Criticize this argument or use it to give a simpler 
proof of Theorem 1. 

12. What else would you like to know? Spend a little 
time thinking about the material in this section and 
perhaps especially in the problems above. Can you 
anticipate the solution of any of the classical construction 
problems: doubling the cube, trisecting an angle, squaring 
the circle? Are you curious about anything? Do any 
questions occur to you that seem interesting? Make a 
short list of your questions and ideas, spending at least 
enough time on each to satisfy yourself that it is 
nontrivial. Keep these ideas in mind, jotting down others 
as they occur to you as you proceed through the material 
in subsequent sections. 


Section 1.2. Doubling the Cube 

We begin by recalling the classical problem of “doubling 
the cube”: Given a line segment representing the edge of a 
cube, is it possible to construct another line segment 
representing the edge of a cube of twice the volume of the 
first? If we take our unit of length to be the length of the 
given segment, then the desired segment must have length 
. So the problem is simply to determine whether ^2 is 
constructible. By Theorem 1, if yjl were to be 
constructible, there would have to be associated with it a 
sequence of quadratic field extensions. That this is 
impossible will follow easily from the following lemma, 
which also happens to give a complete answer to the 
question raised in Problem 3 of the previous section. 
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Lemma 2. Let F (yjk ) be a quadratic extension of a field 
F. If yi is in F(VF ), then \Jl must be in F itself. 

Proof. We are given that y/2 may be written in the 
form a + b^k , with a, b, k £ F, fk F. We want to show 
that b must be 0. The calculation 

2 = (a + bfk ) 3 

= a 1 + 3 a 2 bfk + 3 ab 2 k + b 3 kfk 

= [a 3 + 3 ab 2 k] + [3 a 2 b + b 3 k]fk 

implies that 3 a 2 b + b 3 k = 0, for otherwise we could solve 
for \/k which would then have to be an element of F. But 
then we also have 

(a - bfk ) = [a 3 + 3 ab 2 k] - [3 a 2 b + b 3 k]fk = 2, 

which gives us another cube root of 2, namely a - bfk . 
Since the function y = x 3 is strictly increasing, there is at 
most one real cube root of 2. This requires that b - 0. I 

And now the main result follows: 

Theorem 2. It is impossible to “ double the cube”. 

Proof. As noted in the introductory remarks, 
doubling the cube is equivalent to constructing yjl . If 
^2 were constructible, by Theorem 1 there would exist a 
sequence of fields Q = F 0 c F, c F 2 c • • • C F^, each a 
quadratic extension of the previous one, such that 
V? £ Fjy. The repeated application of Lemma 2 
obviously implies that yjl G Q, which contradicts the 
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irrationality of ^2 . (The irrationality of ^2 may be 
established very simply as in Problem 2 of the previous 
section.) a 

The basic idea of this section will be slightly generalized 
in the next section, where it will yield the solution of the 
angle trisection problem. 


PROBLEMS 

1. Can the cube be “tripled”? 

2. Can the cube be “quadrupled”? 


Section 13 Trisecting the Angle 

The classical problem involving angle trisection is: Given 
an arbitrary angle, is it possible to trisect it? (As usual, we 
are restricting ourselves to the use of a compass and an 
unmarked straightedge.) The answer is “no”, but one must 
be careful to understand what this answer means. There 
are certainly some particular angles that can be trisected, 
as, for example, the angle of 90°. (Why?) However, it is 
not possible to trisect an arbitrary angle. To put it another 
way, there exists at least one angle that cannot be 
trisected. We shall actually produce one such angle, in 
particular the innocuous-looking angle of 60°. 

Since a 60° angle can be constructed, if it could also be 
trisected, then a 20° angle could be constructed. But then, 
as illustrated in Figure 5, the number cos 20° would be 
constructible. It will be shown, however, that the number 
cos 20° is a root of a cubic equation which has no 
constructible roots, thus leading to a contradiction. 
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Mark off a unit length on one side and drop a perpendicular to the 
other. 

Figure 5 

From elementary trigonometric formulas, we have 
cos 3# = cos(2# + 0) 

= cos 26 cos 0 — sin 29 sin 0 
= (cos^# - sin 2 # )cos 0 — (2 sin 0 cos 0 )sin 0 

= cos 3 # - 3 sin 2 # cos # 

= cos 3 # - 3(1 - cos 2 #)cos # 

= 4 cos 3 # - 3 cos #. 

When # = 20°, cos 3# = £ , so the number cos 20° must be 
a root of the cubic equation 

8« 3 - 6m - 1=0. 

This equation is made a little simpler by the change of 
variable x = 2 u, which yields 

jc 3 — 3jc — 1=0. 

If cos 20° were constructive, then it would be possible to 
construct a root of this equation. As the first step in the 
proof that this is impossible, we make an observation 
which is very similar to that of Lemma 2: 
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Lemma 3. Let F(\/& ) be a quadratic extension of a field 
F. If the equation jc 3 — 3jc — 1=0 has a root in F(fk ), then 
it has a root in F itself. 

Proof. If the equation has a root in F(V^ ), we may 
denote it by a + b^fk , where a, b, and k are in F. If b = 0 
we are done, for a G F is a root. If 6 # 0, it will be shown 
that - 2a is a root, and this is obviously in F. 

Substituting the given root into the equation, we obtain: 

( a + b'lk ) - 3(a + bfk ) — 1 = 0 

a 3 + 3 a 2 b^k + 3 ab 2 k + b 3 kfk -3 a - ’ib'jk —1=0 

[a 3 + 3 ab 2 k - 3a - 1 ] + [3 a 2 b + b 3 k — 3 b]Jk = 0. 

But now we can conclude that [3 a 2 b + b 3 k - 3 b] = 0, for 
otherwise we could solve for Jk as an element of F. But 
this then also requires that [a 3 + 3ab 2 k - 3a — 1] = 0. 
After dividing the first of these two equations by b (which 
is not 0), we have 

3a 2 + b 2 k- 3 = 0 
a 3 + 3 ab 2 k — 3a — 1 = 0. 

Solving the first for b 2 k and substituting the result into the 
second, we obtain 

a 3 + 3a(3 - 3a 2 ) - 3a - 1 = 0 

— 8a 3 + 6a - 1 = 0 
( — 2a) 3 — 3(-2a) - 1 = 0 

from which we see that -2a, which is in F, is a solution 
of the original equation. ■ 
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Now the main result is easily accessible. 


Theorem 3. It is not possible to trisect an arbitrary 
angle. 


Proof. If it were possible to trisect an arbitrary angle, 
it would be possible to trisect, say, a 60° angle, which is 
itself constructible. But, as noted earlier, this would imply 
the constructibility of cos 20° and hence the constructibil- 
ity of a root of the cubic equation x 3 - 3x - 1 = 0. By 
Theorem 1, such a root would have to belong to a field 
situated in a sequence Q = F 0 c F, c F 2 c • • • c F w in 
which each field is a quadratic extension of the previous 
one. By a repeated application of Lemma 3, then, there 
would have to be a rational root of x 3 - 3x - 1 — 0. But 
this will now be shown to be impossible. 

If a rational number M/N, expressed in lowest terms, 
were a root of the equation, we would have 


A/ 3 

N 3 


3 M 
N 


- 1 = 0 , 


and so 


M 3 — 3MN 2 — N 3 = 0. 
Thus we could write both 

M 3 = N[3MN + N 2 ] 
and 


N 3 = M[M 2 -3N 2 ]. 

From the first we conclude that if N had a prime factor p, 
then p would divide M 3 and hence M. From the second 
we conclude that if M had a prime factor q, then q would 
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divide N 3 and hence N. Since M and N were assumed to 
have no factors in common, the only possible values for 
M and N are ± 1, and so the only possible rational roots 
are ±1. But by inspection, neither of these actually turns 
out to be a root. i 

It is important to realize that there is a difference 
between the questions: 

(i) Is it possible to construct the angle 91 

(ii) Given the angle 36, is it possible to construct the 
angle 91 

The second question is the one addressed by the angle 
trisection problem, and it only reduces to the first 
question in cases where the angle 39 is itself constructible. 
It happened to be convenient to use such a case in the 
proof of Theorem 3, but the reader should know that 
there are angles for which the answer to question (i) is 
“no” but the answer to question (ii) is “yes”. To analyze 
these two different questions, it is useful to convert them 
into more algebraic terms. Remembering that in the 
presence of a unit segment we have available an angle if 
and only if we have available a segment whose length is 
its cosine, we can rephrase these questions as follows: 

(i') Is cos 9 constructible? 

(ii') Is cos 9 constructible if, in addition to a unit 
segment, we are also given a segment of length cos 3 91 

In Problem 4, the reader is asked to show the existence of 
an angle for which these questions have different answers. 


PROBLEMS 

1. It is obvious that by repeated bisection it is pos- 
sible to divide an arbitrary angle into 4 equal parts. Show 
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how this may also be deduced algebraically from the 
equation relating cos 4 9 and cos 9. 

2. Discuss the validity of the following proof of 
Theorem 3: “By the continuity of cos 9, there exists an 
angle 9 0 such that cos 9 0 = ( 3 \/2 )/2. By Theorem 2, cos 9 0 
is not constructible. Hence 30 o is an angle that cannot be 
trisected.” 

*3. Prove that it is not possible, in general, to quintisect 
an arbitrary angle, i.e., to divide it into five equal parts. 

4. Is the difficulty in Problem 2 real or imagined? 
Show that there are trisectible angles 3 9 such that 9 is not 
constructible. 


Section 1.4. Squaring the Circle 

The third classical construction problem, that of squaring 
the circle, turns out to be much more difficult than the 
previous two. The question is: Given a circle, is it possible 
to construct a square with the same area as the circle? If we 
take as our unit length the radius of the circle, its area is 
•n, and so the construction of the appropriate square 
amounts to the construction of a segment of length 'fn , 
which would necessitate the constructibility of it itself. It 
will be shown, however, that it is not constructible, and 
thus that it is not possible to square the circle. The proof 
that it is not constructible will consist of two main parts: 
first, that every constructible number is the root of some 
polynomial equation with rational coefficients; second, 
that it cannot be a root of any polynomial equation with 
rational coefficients. The first part, which is fairly simple, 
will be covered in this section, The second part, which is 
much more difficult, will be proved in Section 1.7, the 
intervening sections being used to introduce some ideas 
needed there as well as later on. 
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We shall make use of the following simple lemma: 

Lemma 4. Let F (fk ) be a quadratic extension of a field 
F. If a is a root of a polynomial equation of degree n with 
coefficients in F(fk ), then a is a root of a polynomial 
equation of degree 2 n with coefficients in F. 

Proof. By hypothesis, a is a root of an equation of the 
form 

(a„ + bjk)x n + (a„_, + b„_ l fk)x n ~' 

+ • • • + (flo + ) = o, 

where for all i, a, and b, E F, and furthermore, a„ 
and b n are not both zero. Transferring the terms involving 
fk to the other side of the equation, we obtain 

a„x n + a n _ l x n ~' + ■ ■ • + a 0 

= - fk(b n x" + b n _tX n -'+ ■ • • +b 0 ). 

Upon squaring both sides and collecting all terms together 
again on the left, we have a polynomial equation of which 
a is also a root. Its degree is 2 n, because the coefficient of 
x 2n , the highest power present, is a 2 - kb 2 , which is not 0 
by the assumption that fk ^ F. ■ 

Using this lemma, we are now able to prove: 

Theorem 4. Every constructible number is the root of 
some polynomial equation with rational coefficients. 

Proof. If the number a is constructible, then by 
Theorem 1 there is a sequence of fields Q = F 0 c F, 
C • • • CF„ such that a EF n and for each j, 0 < j 
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< N — l, Fj +i is a quadratic extension of Fy. Now, a 
satisfies a polynomial equation of degree 1 with coef- 
ficients in F N , namely, the equation x - a = 0. By the 
repeated application of Lemma 4, a satisfies a polynomial 
equation of degree 2' with coefficients in F A ,_ ( . Hence for 
i = N, we see that a satisfies a polynomial equation of 
degree 2 N with coefficients in Q. ■ 

In view of Theorem 4, once it has been shown (in 
Section 1.7) that i t is not the root of any polynomial 
equation with rational coefficients, we shall have a 
complete proof of the result stated earlier, namely: 

Theorem 5. It is not possible to “ square the circle ”. 

PROBLEMS 

1. The number ^2 + \/T, being constructible, has 
associated with it a sequence of quadratic field extensions 
Q = F 0 cF,c • • • cF, such that V2 +VT EF N . De- 
termine the smallest such number N. Using this value, 
what is the degree of the polynomial whose existence is 
proved in Theorem 4? Following the procedure in the 
proof of Theorem 4, determine this and any intermediate 
polynomials. 

2. Does there exist a polynomial equation with 
rational coefficients and of degree less than that 
determined in Problem 1 such that V2 - + V3 - is a root? 


Section 1.5. Polynomials and Their Roots 

It now becomes essential for us to expand our 
considerations to the set C of complex numbers. These 
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are, of course, numbe rs of the form a + bi, where a and b 
belong to R and i =V~ 1 . In fact , we may regard C as a 
quadratic extension of R, C = R(V- 1 ), now that V— 1 is 
in the set we are considering. The complex numbers form 
a field, which can be shown by verifying that the proof of 
Lemma lc is also valid for k < 0; in this case k = — 1. 

From now on, the term field will refer to a subset of C 
containing the number 1 and closed under the rational 
operations, which definition certainly includes all the fields 
we have been considering so far. By a polynomial over a 
field F is meant a function of the form a n x n + 
a n _ x x n ~ x + • • • + a { x + a 0 where the coefficients a j are 
elements of F and where n is a non-negative integer. The 
domain of the independent variable x will be understood 
to be C except where stated otherwise. The degree of such 
a polynomial is the highest exponent of x in a nonzero 
term. The degree of the polynomial identically equal to 0 
is undefined. We shall represent polynomials by conven- 
tional function notation. By a root of a polynomial is 
meant a value of x for which the function value is 0. 

The following theorem, which we will not prove, is 
called the Fundamental Theorem of Algebra: 

Theorem 6. Every polynomial over C of degree > 1 has 
at least one root (in C). 

The proof of this theorem is most easily accomplished 
using elementary complex variable theory, and the reader 
may find it in almost any book on that subject. More 
elementary proofs are cited in the “References and Notes” 
section at the end of this chapter. Included as special 
cases, of course, are polynomials over R or over Q. 

If f(x) and g(x), which we may also refer to simply as / 
and g, are polynomials, we say that g(x) divides f(x) if 
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there is a polynomial q( x) such that f(x ) = g(x)q(x). If 
f(x) and g(x) are polynomials over a field F, then by the 
usual long division of polynomials as in high school 
algebra, it follows that q(x) is also over F. Clearly, for g to 
divide /, the degree of g must be less than or equal to the 
degree of /. 

It is an elementary observation that if / is a polynomial, 
the number r is a root of / if and only if (jc — r) divides /. 
For, as long division shows, the remainder upon dividing/ 
by (jc — r) is simply f(r). (See Problem 1.) If r is a root of 
/, we define the multiplicity of the root r to be the largest 
integer m such that (x - r) m divides /. This leads to the 
following refinement of Theorem 6: 

Theorem 7. Every polynomial over C of degree n has 
exactly n roots, each counted as many times as its 
multiplicity. 

Proof. By induction on n. When n = 0, the polynomial 
is a nonzero constant, and hence has no roots. If / is a 
polynomial of degree n > 1, then by Theorem 6 it has a 
root r. Factoring (x - r) out of f(x), we are left with a 
polynomial of degree n — 1, to which the inductive 
hypothesis applies, giving another n - 1 roots (some of 
which might coincide with r or with each other). ■ 

Most of the material encountered so far in this section 
is probably somewhat familiar to the reader. But now we 
want to take a closer look at certain sets of numbers 
appearing as the roots of polynomials. In particular, we 
say that a number a (which, as usual, may be complex) is 
algebraic over the field F if there is some polynomial (not 
identically 0) over F of which a is a root. In cases where F 
is not specified, we understand it to be Q. Thus we would 
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say that 3, and \Jl are algebraic, as they are roots, 
respectively, of the following polynomials over Q : x - 3, 
x 2 + 1, x 3 — 7. Any number which is not algebraic is 
called transcendental. 

To phrase our current problem in this language, we 
have shown (Section 1 .4) that all constructible numbers are 
algebraic. To complete the proof of the impossibility of 
squaring the circle, we want to show that -n is 
transcendental. 

Let us now make a very important observation, which is 
not at all obvious: 

Theorem 8. If F is a field, then the set of all numbers 
algebraic over F also forms a field. 

Proof. Suppose that a and b are algebraic over F, and 
thus they satisfy, respectively, polynomial equations 

a n x n + a„_ + • • • +a 0 = 0, 

b m x m + b m _ x x”~' + • • • + 6 o = 0, 

where all the coefficients are in F and a n , b m # 0. It is 
easy to see that - b is algebraic over F, as it satisfies 

(-l) m b m x" + (-\)"-'b m _ l x"'-'+ ■ ■■ + b 0 = 0; 
and if b # 0, 1/6 is algebraic over F, being a root of 
b m + b m - ,*+ • • • +b 0 x m = 0. 

Thus, it suffices to show that a + b and ab are algebraic 
over F, because subtraction and division reduce to these 
cases. One more observation: using the original equations, 
a ” and b m can be expressed in terms of lower powers of a 
and b respectively; for example, a n = -(a n _ ] a n ~ l + 
• ' ' + «o )/«„• 
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Now consider the table of mn numbers: 


1 

a 

u 2 

• a""' 

b 

ab 

a 2 b ■ • 

• a n -'b 

b 2 

ab 2 

a 2 b 2 

■ a n ~'b 2 . 

b m-\ 

ab m 

a 2 b m -' ■ - 

■ a n -'b m -' 

Listing them 

in 

any order, and 

calling them c„ c 2 , 


. . . c R , where R = mn, we note that for every i, ac, can be 
written as a linear combination of all the c/s: 

ac i = 4i c i ^a c i + • • * + d,R c R , 

where the coefficients d tj e F. Actually, most of the dy's 
are 0; for example if c, is one of the entries not in the last 
column of the table, ac i is just the single entry to its right. 
If c, is one of the entries in the last column, then ac , has 
the form a n b k , in which a" is then replaced by its 
equivalent in terms of lower powers of a. In any case, the 
above representation is possible. Denoting by C the 
column vector whose R entries are the c/s, and by D the 
R X R matrix of the 4/s, we can rewrite the above 
equations in matrix form, 

aC = DC. 

Similarly, there is a matrix E such that 
bC = EC. 

Thus we have 

(a + b)C = (D + E)C. 

Since C # 0, this implies that a + b is an eigenvalue of 
the matrix D + E. But then a + b is a root of the 
characteristic polynomial, det(Z) + E — xl), where / is the 
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identity matrix. Since this is a polynomial over the field F, 
we conclude that a + b is algebraic over F. 

For the product ab, observe that 

(ab)C = a(bC) = a(EC) = E(aC ) = E(DC) = ( ED)C , 

and thus ab is an eigenvalue of ED. So by the same 
reasoning as above, ab is algebraic over F. ■ 

It is useful to note that if / is a polynomial over a field 
F, division of / by its leading coefficient (the coefficient of 
the highest power of x) yields another polynomial over F 
of the same degree and with the same roots as /. The new 
polynomial is monic, meaning that its leading coefficient is 
1. It is easy to see that a monic polynomial can be written 
as the product JI,(* - r,), where the r’ s are its roots, each 
occurring as often as its multiplicity. By a similar 
procedure, for any polynomial over Q there can be found 
a polynomial with integral coefficients having the same 
degree and roots as /. 


PROBLEMS 

1. a. If / is a polynomial of degree n > 1 over a field 
F, and if r is any element of F, show that there is a 
polynomial q of degree n — 1 over F and a number R in F 
such that /(x) = (x - r)q(x) + R. 

b. Show that, in the above, R = f(r). 

c. Let / be a polynomial over C. Show that r is a 
root of / if and only if (x - r) is a factor of /. 

2. If / is a polynomial with integral coefficients, it is 
sometimes useful to know whether it has any rational 
roots. Suppose M/ N is a rational root expressed in lowest 
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terms. There is a simple necessary condition connecting M 
and N with the first and last coefficients of the poly- 
nomial. Find it. (This result is called the Rational Roots 
Theorem. It gives a necessary condition that can be used 
to narrow down the candidates for a rational root to a 
small collection which can each be tested.) 

3. Use the result of Problem 2 to find any rational 
roots of each of the following: 

a. 8x 3 - 6x - 1 

b. jc 3 — 3jc — 1 

c. 2x 3 + 3x 2 - x - 1 

d. 3x 5 — 5x 3 + 5x — 1 

e. x 3 + Ax 2 + x - 6. 

4. If F is a countable field, show that the field of all 
numbers algebraic over F is also countable. 

5. Show that the result of Problem 2 of Section 1.1 is 
just a special case of Problem 2 of this section. 

6. If r is a root of multiplicity > 2 of a polynomial /, 
show that f(r) = 0, where /' denotes the derivative of /. 


Section 1.6. Symmetric Functions 

A very useful tool in the study of polynomial equations 
is the concept of a symmetric polynomial in several 
variables. To begin, a polynomial over a field F in the 
variables x,, x 2 , . . . , x n is a function of the form 

P(x„ x 2 , . . . , x„) = ^ a ,„ h iA'x‘ 2 1 •••*«" 

'1. '2, • • • > 

where the i’s are non-negative integers, the coefficients, 
the a’s, are elements of F, and the sum contains only a 
finite number of terms. The degree of a term in such a 
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polynomial is the sum of the exponents of the x/s in the 
term, and the degree of the polynomial is the highest degree 
of any term. As in the case of polynomials in one variable, 
degree is not defined for the polynomial which is 
identically 0. 

As examples of such polynomials, x] + x,xf is a 
polynomial over Q in two variables and of degree 4; 
V T x,x 2 x 3 is a polynomial over Q(V2" ) in three variables 
and of degree 3; > s a polynomial over Q in n 

variables and of degree 1. 

Some polynomials in several variables have a curious 
property; they remain unchanged by any permutation of 
the variables! For example, xj 4- x 2 has this property, 
because if jc, and x 2 are interchanged, the result is 
x 2 + xf, which is the same polynomial that we started 
with. Similarly, x,x 2 x 3 also has this property; it is 
unchanged by any permutation of the variables. However 
x] - x 2 does not have this property; the interchange of x, 
and x 2 results in the negative of the original polynomial. 
We say that a polynomial in n variables is symmetric if it 
is unchanged by each of the n\ permutations of the 
variables. That is, if <f> is a one-to-one mapping from the 
set of integers 1,2, ... ,n to itself (a permutation of the 
first n integers), for P to be symmetric we require that 
^(^*( 1 )- • • • > *<>(«)) = p ( x v x 2 , . . x„). So we would 

say that x\ + x\ is a symmetric polynomial in two 
variables, whereas xj - x\ is not. 

Certain important symmetric polynomials in n variables 
arise as the coefficients of the powers of x in the 
expansion of JJ". ,(x - x,). Writing this expansion as a 
polynomial in x, 

I! (x — Xj) = x” — a,x" _ 1 + o 2 x n ~ 2 — • • • +(-l)"°„, 

i- i 
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it is not hard to see that 

o, = x, + x 2 + ■ ■ ■ + x„ 

°2 = * 1*2 + * 1*3 + ‘ • ' + * 2*3 + ' ' ' + *„- 1 *„ 

0, = sum of all products of i different xf s 


0„ = x,x 2 • • • 

These 0,’s are obviously symmetric polynomials in 
x v x 2 , . . . , x„. From these calculations we are able to 
make some useful observations about the roots of an wth 
degree polynomial in x. If r x , r 2 , . . . , r n are the roots of 
a„x n + + • • • + a 0 , then the sum of the roots 

equals -a n _ x /a n and their product equals (-l)"a 0 /a n , 
since 


II (x - r) = x n + ^ x" ^ + —— ~ x n 2 


1-1 


+ 


(Elementary applications of these facts are pursued in the 
problems.) Symbolically, of course, the sum of the roots is 
0,(r„ r 2 , , r n ) and their product is 0„(r„ r 2 , . . . , r n ). 

The 0,’s just defined are called the elementary symmetric 
functions, and it may be surprising to learn that every 
symmetric polynomial in n variables can be expressed as a 
polynomial in these elementary symmetric functions! This 
result, which will be proved below, is called the 
Fundamental Theorem on Symmetric Functions. As an 
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example, we observe in the case n- 3 that 
x] + x\ + xj = (x, + x 2 + x 3 ) 2 - 2 (x,x 2 + x,x 3 + x 2 xf) 

= of — 2 a 2 - 

Theorem 9. Every symmetric polynomial P over F in 
x { , x 2 , ... ,x„ can be written as a polynomial Q over F in 
the elementary symmetric functions. If P has all integral 
coefficients, then so too does Q. The degree of Q is less than 
or equal to the degree of P. 

Proof. Let M be arbitrary but fixed, M > 1. We shall 
prove the result for all polynomials in x x ,x 2 , ... , x n of 
degree less than or equal to M. 

To begin, we associate with each term of the symmetric 
polynomial P the n-tuple containing the exponents of x v 
x 2 , ... ,x n respectively (some of which may be 0). A 
linear ordering of these /i-tuples can be obtained by 
defining (/,, i 2 , ...,/'„)> (J v j 2 , • • . ,j„) if in the first 
position from the left in which they differ, say position k, 
i k > j k . (This is the same idea as alphabetical ordering.) 
With respect to this ordering, the term of P with the 
highest n-tuple of exponents is called the highest term. 
(This can be different from a term of highest degree.) For 
all such polynomials of degree < M, there are clearly only 
a finite number of possible exponent n-tuples less than the 
exponent n-tuple of the highest term. We denote this 
number by N, and our proof will be by induction on N. 

For example, if M-4 and P(jr ( , x 2 , x 3 ) - x, + x\ + 
x 3 , then N = 31, for the highest exponent 3-tuple is (3,0,0), 
and this may be followed, in decreasing order among all 
possible 3-tuples in polynomials of degree < 4, by: 
(2,2,0), (2,1,1), (2,1,0), (2,0,2), (2,0,1), (2,0,0), 
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(1, 3, 0), (1, 2, 1), (1, 2, 0), , (0, 0, 1), (0, 0, 0), of which 

there are a total of 31. 

To prove the result for N = 0, we observe that in this 
case the polynomial is a nonzero constant, which can also 
be considered as a (constant) polynomial in the a,’s. 

Assuming now that the result is true for N <, R, we 
shall prove it for N = R + 1. If the highest term of P is 
ax{'x 2 1 ■ ■ ■ x' n *, we first observe that /, > i 2 > ■ • ■ > i„. 
For, if there were a j with L < ij + ,, the permutation which 
simply interchanges Xj and jc +1 would result in a higher 
term, which, since P is symmetric, must be included in P. 
Thus we can construct the polynomial Q = aa['~‘ 2 a^~ i} 
• ■ • which is a polynomial in the a’s of degree 

less than or equal to the degree of P. If for each o, we 
substitute its expression in x,, x 2 , . . . , x n , and then 
expand out in the form of a polynomial in the jc/s, we find 
that the highest term is precisely the same as the highest 
term of P. To see this, observe that the highest term of a 
product of polynomials is the product of their highest 
terms (see Problem 5). Thus the highest term in the 
expansion of Q is ax{'~‘ :i (x- [ x?)' 2 ~‘'> • • ■ (x,x 2 • • • x„)'" 
= ax['x± 2 • • • x„'". It is also easy to see that every term of 
this expansion has the same degree, and thus, as a 
function of the x,’s, Q has degree less than or equal to the 
degree of P. Consequently, the difference P = P — Q is 
either 0, in which case we are done, or else another 
symmetric polynomial with a lower highest term. In this 
case, _the inductive hypothesis applies to P, and so 
P = P + Q which is of the required form. If P has integral 
coefficients, then so too does P. i 

An important consequence of Theorem 9 which will be 
applied repeatedly in our treatment of the roots of 
polynomials is the following: 
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Corollary 9a. Let f(x) be a polynomial over F of 
degree n with roots r,, r 2 , . . . , r n . If P is any symmetric 
polynomial over F in n variables, then P(r i ,r 2 , . . . , r n ) is an 
element of F. 

Proof. By Theorem 9, P may be expressed as 
a polynomial in the elementary symmetric functions. 
But if f(x) = a n x n + a n _ l x n ~ 1 + • • • + a 0 , then 
o,( r i> r 2 > • • • t r„)= ±a n _Ja„, which is in F. Hence 
P(r l5 r 2 , . . . , r n ) is itself in F, since F is closed under the 
operations used to evaluate polynomials (addition, 
subtraction, and multiplication). i 

As an example of this result, consider the polynomial 
f(x) = x 1 + 2x - 1, which is over Q, and which has as 
roots /■,= — 1+V2 and r 2 =-\-fl. Taking any 
symmetric polynomial in two variables, say P(x v x 2 ) 
= + x\, the corollary asserts that P(r,, r 2 ) will turn out 

to be rational. We can verify this directly: 

P(r.,r 2 ) = (-l+V2) 3 +(-l-^) 3 
= (-l + 3\£’-6 + 2VT) 

+ ( — 1 — 3V5” — 6 — 2V5" ) 

= -14, 

which is indeed in Q. 

Using Corollary 9a, it is possible to give another proof 
that the set of all numbers which are algebraic over one 
field is a field itself (Theorem 8). This is pursued in 
Problem 6. Corollary 9a also leads to the more general 
result, Corollary 9b, which is somewhat technical but 
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forms an essential link in our upcoming proof that rr is 
transcendental. 

Corollary 9b. Let f be a polynomial of degree n over 
F with roots r,, r 2 , . . . , r n . Let R be fixed, 1 < R < n. Let 
s,, s 2 , . . . , s m be the collection of all sums r ( + 
#•, + ••• + r i/t , where the R subscripts are all different. 
Then there is a polynomial over F whose roots are precisely 
■Sj) S 2 , • . • ) s m . 

Proof. The candidate is certainly ,(x - s t ); one 
merely has to show that if this is expanded out, the 
coefficient of each power of x is an element of F. Since 
the coefficients are, except possibly in sign, the elementary 
symmetric functions of m variables, evaluated at the sf s, it 
will certainly suffice to show that any symmetric 
polynomial over F in m variables, evaluated at the s’ s, is 
an element of F. Let P(x v x 2 , . . . , x m ) be such a 
symmetric polynomial. We consider n new variables, 
u u u 2 , and we replace in P each x i by a certain 

specific sum of R different u’s, each x, being replaced by 
a different such sum. Then we claim that P becomes a 
symmetric polynomial over F in the variables 
u 2 , . . . , To see this, we first note that any permutation 
of the u’s may be accomplished by a sequence of 
transpositions (single interchanges of two of the s). See 
Problem 7 for a proof. Thus it remains to show that P is 
invariant under the transposition interchanging, say, n, 
and Uj. In this case the x,’s containing neither or both of 
these as summands are unchanged, and the remainder 
are permuted. Thus, by the symmetry of P in the x’s, P is 
left unchanged. Plugging in the r’s for the u’s, we get 
P(s u s 2 , . . . , s m ), which by Corollary 9a is an element 
of F. i 
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PROBLEMS 

1. The cubic 2x 3 - 3x 2 — 32.x — 15 has two roots 
whose sum is 2. Find all the roots. 

2. Find the product of all the roots of 2x 5 — 3x 4 + 
7x 3 - 7x 2 + 3x - 1. Are any of the roots rational? 

3. Find the value, expressed in terms of the co- 
efficients, of the sum of the squares of the roots of 
a n x n + a„_ ] x n ~ 1 + • • • + a 0 . 

4. Find the value, expressed in terms of the co- 
efficients, of the sum of the reciprocals of the roots of 
a n x" + a„_ l x"~ l + • • • + a 0 , where a 0 # 0. 

5. Let P x and P 2 be two polynomials in n variables. 
Show that the highest term of the product P i P 2 is the 
product of the highest terms of P ] and P 2 . (The word 
“highest” refers to the ordering on the exponent n-tuples.) 

6. (Alternate proof of Theorem 8.) Suppose a and b 
are algebraic over a field F. Then a is a root of some 
polynomial over F, whose complete set of roots may be 
written a = a v a 2 , . . . , a n , where each root is listed as 
many times as its multiplicity. Similarly for b, giving rise 
to roots b = b v b 2 , . . . , b m . Apply the theory of sym- 
metric functions twice successively to the expression 

n m 

UU(x-a,- bj) 

i=\ j-\ 

to conclude that a + b is algebraic over F. In a similar 
way, show that ab is also algebraic over F. 

7. Prove that every permutation of the variables 
x t , x 2 , . . . , x n can be accomplished by a succession of 
transpositions, where a transposition is the single 
interchange of two of the variables. (This was used in the 
proof of Corollary 9b.) 
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8. With reference to the preceding problem, there may 
be many different sequences of transpositions resulting in 
the same permutation. Show that for a given permutation, 
all sequences of transpositions leading to it have the same 
parity; that is, either they all contain an even number of 
transpositions or else they all contain an odd number. (In 
this way, permutations can be classified as “even” or 
“odd”.) 


Section 1.7. The Transcendence of n 

The proof of the transcendence of m relies on a wide 
collection of mathematical techniques, ranging from 
calculus to elementary number theory. In this respect, in 
addition to the fundamental importance of the result, it 
should be of interest and worthy of careful study. 

Use will be made of the function e 2 for z complex, with 
which the reader may be unfamiliar; so we begin with 
some comments about this function. If z is a complex 
number it may be written in the form z - x + iy, where x 
and y are rpal. The numbers x and y are called the real 
and imaginary parts of z, respectively. The expression e z is 
then defined to be the complex number e x (cos/ + 
i sin y), which involves only the usual exponential and 
trigonometric functions of real variables. From the 
definition it follows that e m = e°(cos tt + i sin w) = — 1. 
Also, it can be shown rather easily (Problem 1) that e z 
obeys the usual laws of exponents; in particular, 
e z 'e Zl = e z ' +z *. These are the only properties that we shall 
use here. 

One number theoretic lemma will also be useful: 

Lemma 10. The product of any n consecutive positive 
integers is divisible by n\. 
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Proof. Calling the integers m + l, m + 2, . . . , m + n, 
we wish to show that the quotient (m + l)(m + 2) 
• • • (m + n)/n\ is an integer. Multiplying it by 1, written 
as m\/m\, puts it in the form ( m + ri)\/m\ n\, which is 
simply one of the binomial coefficients, and hence an 
integer. (Cf. Problem 2.) i 

We are now ready to prove: 

Theorem 10. tt is transcendental. 

Proof. Suppose it is algebraic. This will eventually lead 
to a contradiction. Since i is also algebraic, m is algebraic 
by Theorem 8, and thus there is a polynomial over Q 
whose roots are m = r„ r 2 , . . . , r„. Since e r ' = — 1, we 
have 

(<?'■+ l)(e rj + 1) - • • ( e r -+ 1) = 0, 

which we can multiply out, combining only the terms with 
0 exponents, to obtain 

e q ' + e qi + • • • + e qm + k = 0 

where k is a positive integer, being a sum of l’s, and the 
q’ s, some of which may be equal, are all different from 0. 

Claim 1. There is a polynomial with integral coef- 
ficients whose roots are precisely q x , q 2 , . . . , q m . 

Proof. In expanding out the product as described 
above, the set of terms we obtain before combining any- 
thing consists of the single number 1 and the number e 
raised to various powers. The entire collection of such 
powers is just the set of all sums of R distinct r ’ s (distinct 
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by subscript at least), as R ranges from 1 to n. By 
Corollary 9b, for each fixed value of R, there is a 
polynomial over Q with precisely those sums as roots. The 
product over all R of these polynomials is practically what 
we want. We simply need to modify it slightly first by 
dividing through by the appropriate power of x so as to 
eliminate the 0 roots, and then by multiplying through by 
an appropriate integer to make all its coefficients integral. 


We shall call this polynomial g, and we shall write it as 
g(x) = ax m + a m _ x x m ~' + • • ■ + a 0 . (The subscript has 
been left off the leading coefficient for the sake of 
convenience.) Its roots are precisely q x , q 2 , ■ ■ ■ , q m . 

Now define an auxiliary polynomial / by 


fix) 


a s x p 1 [g(x)] /> , 

(P ~ 1)! 


where p is a prime to be chosen appropriately later and 
s = mp - 1. The degree of / is p — 1 + pm = s + p. The 
polynomial / is a complex-valued function of a complex 
variable, and it is necessary for us to make use of some of 
the differentiation theory for such functions. The results 
needed are formally identical with results from elementary 
calculus, but of course they require separate justification. 
These results will be summarized in the text, their actual 
development being relegated to the problems. 

For complex functions of a complex variable, deriva- 
tives are defined by the same kind of limit of a difference 
quotient as in the real case. The usual rules for the 
derivative of sums, differences, products, and quotients 
carry over, as do the formulas for the derivatives of 
polynomials and the exponential function. (See Problem 
3.) In this context, denoting the y'th derivative of / by / (j) , 
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we define 

F(x) = f(x) + f a) (x) + f 2 \x) + ■ ■ ■ + f s+p) (x). 

Since f < - s+p+i \x) = 0, we find that the derivative of 
e~ x F(x) is just -e~ x f(x), for 

iU-'FW] 

= e -Jt [/C)(jc) +/< 2 )(x) + • • • +/ <1+/>) (*)] 

-e- x [f(x) + f l \x) + f 2 \x)+ ■ ■ ■ +f is+p) (x)] 
= -e~ x f(x). 

This fact will be used in what follows. 

Next let us regard x as an arbitrary but fixed complex 
number, and look at the function given by 

<j>(u) = e~ xu F(xu), 

where u is a real variable. This is a complex-valued 
function of a real variable. Differentiation and integration 
of such functions are defined by applying these operations 
separately to the real and imaginary parts, both of which 
are simply real functions of a real variable. It is very 
simple to verify that the Fundamental Theorem of 
Calculus, in the form 

<j>(b) - <p(a) = f tf>'(u)du, 

* a 

carries over to such functions (Problem 4). 

It is our intention to apply this theorem on the interval 
[0, 1]. To this end, we need to calculate </>'(»)• This may be 
done by a version of the chain rule whose justification in 
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the present context is treated in Problem 5. Using the 
intermediate variable 


z = ux 


we may write 


It follows that 


<K“) = e 


- 

= — xe~ ux f(ux). 

Now, from the Fundamental Theorem of Calculus, we 
obtain: 

<#»(!) ~<>(0)= f <t>'(u)du, 

J o 

e~ x F(x) - e _0 F(0) = f - xe~ ux f(ux)du, 

F(x)-e x F( 0) = f - xe^~“^ x f(ux)du. 

Letting x successively assume the values q u q 2 , ■ . ■ , q m 
and then adding the results, we have 

S^) + *F(0)--2 / q/'-^f(uqj)du. 

j- 1 7=1 ■'O 


Eventually we shall establish the desired contradiction by 
choosing the prime p so as to make the left side of this 
equation a nonzero integer and the right side arbitrarily 
small. 
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Claim 2. ^JLiFty) is an integer divisible by p. 


Proof. Using the definition of F, we have 

m m s+p s+p m 

2 n %) =22 /%) =22 

j- 1 j= 1 r = 0 r = 0 /— 1 

For 0 < r <p, 

m 

2 /%,) = o, 

7-1 

since by the definition of /, f (r \qf) has at least one factor 
g(qj), which is 0. Thus, 

m s+p rn 

2 F(qj) = 2 2 f r \qj)- 

j - 1 r -p j - 1 

Now let r be arbitrary but fixed, p < r < s + p. Since 
a ~ s (p - 1)! /( jc) is a polynomial with integral coefficients, 
the coefficient of each nonzero term of its rth derivative 
contains the product of r (and hence at least p) 
consecutive integers; therefore, by Lemma 10, it is 
divisible by/)!. Thus each coefficient of p r \x) is divisible 
by pa s , which implies that the expression 





is a symmetric polynomial with integral coefficients in the 
m variables w„ u 2 , . . . , u m . By the Fundamental Theorem 
on Symmetric Functions it may be expressed as a 
polynomial in the elementary symmetric functions and 
having integral coefficients. We now substitute in for the 
variables the values q x , q 2 , ■ . . , q m , and we recall that 
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each elementary symmetric function a t {q x , q 2 , . ■ . , q m ) 
— ±a m _i/a. Since the degree of the polynomial in the 
ff/s is < s, each term has the form of an integer divided by 
some power of a not greater than s. Thus 

m 

2 f r X<ij) = p 

j‘ I 

where the bracketed term is an integer. Since this holds 
for each r such that p < r < s + p, the sum over those r is 
an integer divisible by p. Since this sum equals the original 
expression, the proof of the claim is complete. 

Claim 3. For p sufficiently large, kF( 0) is an integer 
not divisible by p. 

Proof. The terms of F( 0) fall into three categories. The 
terms / <r) (x) for 0 < r < p — 2 all contain a factor x, and 
hence are all 0 at x = 0. The terms f (r \x) for 
p < r < 5 + p are polynomials with integral coefficients 
each divisible by p, as shown in the proof of the previous 
claim. Thus / w ( 0), their constant terms, are integers 
divisible by p. For kF( 0) not to be divisible by p, 
therefore, it suffices to guarantee that the one remaining 
term, kf (p ~ x \ 0), not be divisible by p. From the definition 
of /, the only term contributing to 0) is 

(a s a&/(p— \)\)x p ~ x . Hence kf (p ~ ^(O) = ka s a^. Con- 
sequently, if p is any prime greater than k, a, and a 0 , it 
cannot divide this product. 

Now we return to our earlier equation, in connection 
with which the two previous claims were undertaken: 

'2F(q j ) + kF(0)=- 2 f q/'- u) *f(uqj)du. 
j - 1 J= i 0 


a '% P *W 
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We have shown (Claim 2) that the first term on the left is 
an integer divisible by p and (Claim 3) that for all p 
sufficiently large the second term on the left is an integer 
not divisible by p. Thus the left side is a nonzero integer. 
We will now show that for p sufficiently large, the right 
side may be made arbitrarily small, and hence not equal 
to any nonzero integer. This contradiction will complete 
the proof. 

Now, 


m r 1 

- 2 J <i/'- uH K u qj)du 

j =\ J 0 


J r 1 m 

n : I 


y-i 


a '( u %Y '[ £(«?,)]' 

(P - 1)! 


du 


< 



[a m (uqj)g( u qj)] p 

(p ~ O' 


q je (l 'g(uqj) 


\du 


since a s — a mp ~' i = (The vertical bars on 

the right side of the inequality refer to the magnitudes, or 
moduli, of complex numbers. See Problem 3. The usual 
rules for absolute values of integrals and sums carry over 
directly to this case.) Let B be a uniform bound over 
1 < j < m of the continuous functions \a m {uq ] )g(uq J )\ on 
the closed interval 0 < u < 1 . Let C be a bound on the 
continuous function 


m 


2 \q )e (l “ )% a 
J - ' 


m -'g("qj)\ 


on the same interval. Then the original expression is 
bounded by CB p ~ ] /(p - 1)!. Since this is the p\h term in 
the (convergent) Taylor series around 0 of Ce B , it must 
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approach 0 as /> — > oo. Thus it can be made arbitrarily 
small for p sufficiently large. ■ 

As noted in Section 1.4, the proof of Theorem 10 also 
completes the proof that it is impossible to “square the 
circle”. 


PROBLEMS 

1. Prove that e z 'e Zl = e z,+Z2 . 

2. Prove that the binomial coefficients are in fact 
integers. 

3. The modulus of a complex nu mber z = x + iy, 
where x and y are real, is given by |z| = \x 2 4 - y 2 .If F(z) 
is a complex function of a complex variable, then the 
definition of the statement lim z _^ o F(z) = L is that for 
every € > 0 there exists a 8 > 0 such that |F(z) - L\ < e 
whenever 0 < \z - z 0 | < 5. This is formally identical with 
the real case, the difference being the use of moduli 
instead of absolute values. If G(z) is a complex function 
of a complex variable, the derivative of G is the new 
function whose value at any point z 0 is given by 


G'(z 0 ) = lim 

Z—*Zq 


G{z) - G(z 0 ) 
z-z 0 


(As in the real case, some complex functions are 
differentiable and some are not.) Show that the usual rules 
from elementary calculus carry over in the complex case 
for each of the following: 

(a) the derivatives of sums, differences, products, and 
quotients; 

(b) the derivatives of polynomials; 

(c) the derivatives of e z and e~ z . 

4. Let <fi(u) be a complex-valued function of the real 
variable u on the interval [a, b], and suppose its real and 
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imaginary parts are given by differentiable functions r(u) 
and s(u), respectively, so that 

<f>(u) = r(u) + is(u). 

Then the derivative <t>'(u) is defined by the equation 
<t>'(u) = r\u) 4- is'(u). 

Similarly, the integral f h a <p'(u)du is defined by the equation 

rb r b rb 

I <f>'(u)du = I r'(u)du+ i s'(u)du. 

/ a a 

Prove the relevant form of the Fundamental Theorem of 
Calculus: 


<t>(b) - <p(a) = f <j>'(u)du. 

% 'a 

5. Let jc be a fixed complex number, g a differentiable 
complex valued function of a complex variable, and u a 
real variable. Then the function <j>(u) = g(ux) is a complex 
function of a real variable. Show that <j>'(u) = g'(ux) • x. 

6. In which part(s) of the proof of Theorem 10 was it 
necessary to use the fact that p is prime? 

7. It was implicitly assumed in the proof of Theorem 
10 that there are an infinite number of primes. Where was 
this assumed? Prove this fact. 

8. Prove that the number e is transcendental. (Hint: 
This is quite a bit easier than the corresponding proof for 
ir, but can be developed along the same outline. In 
particular, the assumption that e is algebraic leads 
immediately to an equation very similar to the equation 
e ?l + e ,! + • • • + e qm + k = 0, but where the exponents 
are integers and there may be integral coefficients in front 
of the terms. Working backwards, the reader can 
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construct an analogue of g, and from this analogues of / 
and F. The proof proceeds almost as before, but without 
the complication of symmetric functions. However, some 
coefficients need to be inserted in summing. This problem 
should help the reader gain a better understanding of the 
proof for 77.) 
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Section 2.1. Arithmetic of Polynomials 

In the last chapter, we saw how certain classical geometric 
construction problems depended for their solution on the 
nature of the roots of certain polynomials. We shall take 
another look at these problems later in this chapter. In the 
next chapter, we shall look at another famous problem, 
namely, the problem of trying to find analogues of the 
quadratic formula for the roots of polynomials of degree 
greater than 2. Since all these investigations deal with 
polynomials, we need first to learn more about the 
‘arithmetic’ of polynomials. 

Given a field F, we denote by F[x] the set of all 
polynomials over F in the variable x. As we shall see, this 
set has many properties very similar to those of the set Z 
of integers. First, F[x] is closed under the operations of 
addition, subtraction, and multiplication. But, like Z, it is 
not closed under division; the quotient of two poly- 
nomials is not necessarily a polynomial. In place of 
division, we have ‘division with remainder’. For example, 
in Z, we cannot simply divide 7 by 3; but we can say “7 
divided by 3 is 2 with remainder 1”; and we can write this, 
entirely within Z, as: 7 = 3 • 2 + 1; 2 is called the quotient 
and 1 is called the remainder. In F[x] we have similar 

59 
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‘division with remainder’, namely, the division algorithm 
for polynomials: 

Theorem 11. Let f and g be polynomials in F[x], 
g ^ 0. Then there exist unique polynomials q and r, also in 
F[jc], with either r identically 0 or the degree of r less than 
that of g, such that f = g- q + r. (q is called the quotient of f 
divided by g, and r is called the remainder.) 

Proof. First, let us note that q and r are simply what 
would emerge if we divided / by g according to the usual 
process of long division of polynomials, usually treated in 
high school. For a rigorous proof, let us write 

f( x )= a n x n + a n _ ijc" _, + •••+«(>, 

g(x) = b m x m + b m _ l x m ~'+ ■ ■ ■ + b 0 , b m + 0. 

Let m be arbitrary but fixed; we prove the result for all 
values of n > 0. For 0 < n < m, we must have q = 0 (that 
is, the 0 polynomial) and so r = f. Now we show the result 
by induction on n, for n > m. For n = m, q must be the 
constant a n /b m , since r cannot contribute to the leading 
term. This gives r = /- ( a n /b m )g , thereby determining it 
uniquely as an element of F[x]. Assuming now that the 
result is true up through polynomials of degree n — 1, we 
show it for n. In particular, since /- (a n / b m )x n ~ m g is of 
degree at most n — 1, there are unique polynomials q and 
r, r — 0 or deg r < deg g, such that 

f-{ajb m )x n - m g = g-q + r, 

whence 


f=g-[(a n /b m )x n - m + q] + r. 
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The bracketed term is an acceptable q. It is unique since 
its leading term must be (a n / b m )x"~ m , and from this, by 
the induction, the sum of all terms of lower degree, given 
by q , is uniquely determined. Since q is unique, so too 
is r. i 

It is useful to observe that if, in the context of Theorem 
11, F C F is a smaller field containing all the coefficients 
of / and g, then the proof implies that both q and r 
actually belong to F[x], a subset of F[x]. As a con- 
sequence, if / and g belong to both F,[x] and F 2 [x], where 
F, and F 2 are fields, the result of the division algorithm 
will be the same in both settings because the division 
actually takes place in F[x], where F = F, n F 2 . We refer 
to this fact by saying that the division of one polynomial 
by another is independent of the field over which we 
consider them. 

We say that a polynomial g divides a polynomial /, 
written g\f, if the remainder upon dividing / by g is the 0 
polynomial. For example, we write (x z - l)|(x 4 - 1) since 
(x 4 — 1) = (x 2 - l)(x 2 + 1). In this case, we call g a divisor 
of /, and the expression // g is used for the quotient of / 
divided by g. It is a proper divisor if its degree is less than 
that of / and a nontrivial divisor if its degree is greater than 
0. With these concepts of divisibility, we can now 
introduce the analogue in F[x] of prime numbers in Z. A 
nonconstant polynomial / £ F[x] is called irreducible in 
F[x] or irreducible over F if it has no nontrivial proper 
divisors in F[x]. To put it another way, / is irreducible 
over F if it cannot be written as the product of two lower 
degree polynomials over F. As an example, x 2 + 1 is 
irreducible over Q. But irreducibility depends closely on 
the field in question. The same polynomial is also 
irreducible over R; but over either Q(i) or C it is 
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reducible, factoring into (x - i)(x + /). In fact, over C 
every polynomial is completely reducible, that is, factorable 
into all linear factors. Given a polynomial over a field F, it 
is in general quite difficult to determine whether it is 
irreducible or not. For the field Q, a useful criterion will 
be given below in Theorem 12; and, fortunately, this is the 
case which holds the greatest importance for us. For the 
proof of this theorem, we shall make use of the following 
Lemma of Gauss: 


Lemma 12. If a polynomial with integral coefficients can 
be written as the product of two lower degree polynomials 
with rational coefficients, then, in fact, it can be written as 
the product of two lower degree polynomials with integral 
coefficients. 


Proof. Suppose that / = gh, where / has integral 
coefficients and g and h have rational coefficients. (Here 
and elsewhere, juxtaposition will be understood to mean 
multiplication of polynomials.) Without loss of generality, 
we may assume that the coefficients of / are relatively 
prime (i.e., they have no common factor other than ± 1), 
for otherwise we may divide out the common factors, 
apply the result, and then multiply through again. Let us 
write 


*(*)- + 


a. 


x ' 1 " 1 + • • 


m- 


K 

^-x m + ^-x m ~' + 


+ * 
bo ’ 


+ — 


m 


m — I 


where the a’s, b’ s, c’s, and d's are integers. Multiplying the 
first equation by the product B of all the b' s gives a 
polynomial on the right with integral coefficients, out of 
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which we can factor their greatest common divisor A. 
Thus 


(B/A)g(x) = A„x n + A„_iX n 1 + • • • + A 0 

is a polynomial with relatively prime integral coefficients. 
In like manner, we obtain another such polynomial 

(D/C)h(x)=C m x m + C m _ l x"-' + ■ ■ • +C 0 . 

And from / = gh we obtain BDf = (AC)\(B / A)g\ ■ 
[{D/C)h\. Since BD is the greatest common divisor of the 
coefficients of this polynomial, as we see from looking at 
the left side of the equation, and since AC is a common 
divisor of the coefficients, as we see from the right, then 
AC | BD. Defining E = BD / AC, an integer, we have 

Ef(x) = [A n x l ’ + A„_,x''- l + ■ ■ • +^ 0 ] 

x[C m x m + C m _,x"- l + • • • +C 0 ]. 

If we can show E — ±1, we shall be done; for we shall 
then have written / in the required form. If E # ± 1, it 
has a prime factor p. But p cannot divide all the A/s, for 
they are relatively prime, and so there exists a smallest 
integer i > 0 such that p does not divide A,. Similarly, 
there is a smallest integer j > 0 such that p does not divide 
Cj. Now we compare on both sides of the equation the 
coefficient of x' +J . From the left side, this coefficient is 
divisible by E and hence by p. From the right side, it is 
calculated to be 

i-l i+j 

2 A k C i+ j- k + 2 ^ k Cj+j~ k + AjCj. 

k — 0 k = I + 1 
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Since each A k in the first sum is divisible by p, so too is 
that sum. Since each C j+J _ k in the second sum is divisible 
by p, so too is that sum. But by the definition of i and j, 
the term A j Cj is not divisible by p. Hence the entire sum 
cannot be divisible by p , which is a contradiction. Hence 
E = ±1. i 

We are now able to derive a sufficient condition for 
irreducibility over Q of polynomials with integral 
coefficients. It is called the Eisenstein irreducibility 
criterion. 

Theorem 12. Let f(x) = a n x n + a n _ l x n ~' + • • • + 
a 0 , where the coefficients are all integers. If there exists a 
prime p such that: (i) p divides each of a 0 , a v . . . , a n _p, (ii) 
p does not divide a n ; and (iii) p 2 does not divide a 0 \ then f is 
irreducible over Q. 

Proof If / were reducible, it could be written as a 
product gh of polynomials over Q each with degree > 1. 
By Lemma 12, we may assume g and h have integral 
coefficients. If we write 

g(x) = b k x k + b k _ x x k ~ x + • • • +b 0 , 

h(x) = c m x m + c m _ iJC M ~ l + • • • +c 0 , 

then a n = b k c m , so that p cannot divide either b k or c m . 
Also, a 0 = b 0 c 0 , so p divides exactly one of b 0 and c 0 ; 
without loss of generality assume that p divides b 0 but that 
p does not divide c 0 . Since p does not divide b k , there 
must be a smallest subscript j > 1 such that p does not 
divide by Now, in the equation f — gh compare on both 
sides the coefficients of xK On the left it is a } , which is 
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divisible by p since j < k < n. On the right it is the sum 
b Q Cj + b x Cj _ , + • • • + bjC 0 , which is not divisible by p 
because every term except the last is divisible by p. This 
gives a contradiction. Thus / cannot be reducible. ■ 

It is important to realize that the above theorem only 
gives a sufficient condition for irreducibility over Q. For 
example, it enables us to conclude that x 5 - 2 and 
3x 5 + 7x 4 — 14x 2 + lx + 56 are irreducible over Q (take 
p — 2 and 7 respectively), but it does not give us any 
immediate information about x 3 - 3x - 1 or x 4 + x 3 + 
x 2 + x + 1, both of which are also irreducible over Q. To 
treat situations like these, sometimes a simple change of 
variables can be found to put a polynomial in a form to 
which the theorem is applicable. For example, making the 
change of variables x = u + 1 in x 3 - 3x - 1, we obtain 
m 3 + 3m 2 - 3, which is now seen to be irreducible (p = 3). 
Clearly, the property of irreducibility is not changed by 
this transformation. Use will be made of this technique in 
the problems as well as in later sections. 

So far, irreducible polynomials have simply been 
defined, and for the special case of Q[x] a sufficient 
condition for irreducibility has been given. Noting that 
the definition of an irreducible polynomial is quite similar 
to that of a prime number, we shall now proceed to see 
that the analogy extends much further. One fact about 
primes which we have used all along is that if a prime 
divides a product of two numbers, it must divide one of 
them. A similar statement will turn out to be true for 
irreducible polynomials (Theorem 14). Another useful fact 
about primes is that every integer can be written uniquely 
as a product of primes. Almost the same statement will 
turn out to be true for the factorization of a polynomial 
into irreducible factors (Theorem 15). In order to develop 
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these results, it is first necessary to introduce one more 
concept. 

Given two polynomials / and g in F[x], a greatest 
common divisor of f and g, denoted (/, g), is any poly- 
nomial of maximal degree and in F[x] which divides both 
/ and g. That is, there may be many polynomials 
(certainly including all the constant polynomials) in F[x] 
which divide both / and g; any one of maximal degree is a 
greatest common divisor (abbreviated g.c.d.) of / and g. 
Actually, it will turn out that all the g.c.d.’s of / and g are 
constant multiples of each other. When we write (/, g), we 
shall mean some or any g.c.d., as the context will make 
clear. Given / and g, how do we find (/, g)? Does it 
depend on the field F? (Surprisingly, the answer to this 
second question is “no”.) To begin on the first question, 
we first take a look at Z. 

Perhaps the reader has studied at some time the Eu- 
clidean algorithm, which is a procedure for finding the 
g.c.d. of two integers. It may be best to illustrate it by an 
example. To find, say, the g.c.d. of 882 and 270, first 
divide one by the other, as in the division algorithm: 

882 = 270 • 3 + 72. 

Now divide the most recent divisor (270) by the most 
recent remainder (72): 

270 = 72 • 3 + 54. 

Keep repeating this process until you get remainder 0: 

72 = 54 • 1 + 18 

54= 18-3 + 0. 

The divisor in the last step (18) is the g.c.d. Furthermore, 
by retracing the steps, we find that the g.c.d. can be 
written as a linear combination, with integral coefficients, 
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of the original numbers: 

18 = 72 - 54 

= 72 -[270 -72 -3] 

= [882 - 270 • 3] - [270 - (882 - 270 • 3)- 3] 

= 4 • 882 - 13 • 270. 

This is an important fact, frequently more useful than the 
procedure itself. The reader is asked in Problem 2 to 
prove the validity of the Euclidean algorithm for Z. We 
shall now state and prove the corresponding Euclidean 
algorithm for F[x], the actual procedure, which is not 
important for our purposes, being a by-product of the 
proof. 

Theorem 13. Let f,gE F[x]. Then there exist also s, 
t £ F[x] such that (/, g) = sf+ tg. 

Proof. Begin by dividing / by g, as in the division 
algorithm, to get 

/=£?i + 'Y 

Now divide the most recent divisor (g) by the most recent 
remainder (r,): 

g=r x -q 2 + r 2 . 

Keep repeating the process. Eventually, some r N+l = 0. 
(Make sure you see why.) Thus we have: 

'i = r i-<h + r 3 
r 2 = r 3 ■ q A + r A 

r N- 2 = r N- 1 ’ Rn + r N 
r N- 1 = r N ‘ Hn+ I + 0- 
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It is easy to see that r N can be written in the required 
form, for each r- can be so written. To prove this, we work 
by induction on For i — 1, we have r, = l -f + ( - q^) ■ g. 
For / = 2, we have r 2 = g - r, • q 2 = g - /• q 2 + q x q 2 - g 
= ( -q 2 ) •/+ (1 + q^q 2 ) ■ g. Assuming it true up to i - 1, 
we write r ( . = r,_ 2 - •</, from which the desired 

conclusion is obvious. It remains to show that r N is 
actually a g.c.d. From the representation r N = sf + tg, we 
see that any common divisor of / and g, and hence any 
greatest common divisor of / and g, say (/, g), divides r N , 
and so deg (/, g) < deg r N . But by the equations leading 
to r N , working backwards from the last one, we have r N 
successively dividing r N _ l( r N _ 2 , . . . , r v g, and /. Thus, r N 
is a common divisor of / and g, so that deg r N 
< deg (/, g ). Combining this with the previous result, we 
see that deg r N = deg (/, g), and thus r N is a g.c.d. of / and 
g. Further, since any g.c.d. has been shown to divide r N , it 
follows that all g.c.d.’s of / and g are constant multiples of 
r N and hence of each other. Thus, the representation given 
in the theorem is possible for every g.c.d. of / and g. ■ 

An immediate consequence of Theorem 13 is the fact 
that every common divisor of / and g divides (/, g), which 
property is frequently taken as the very definition of the 
g.c.d. The reader is asked to show in Problem 5 that the 
g.c.d. of two polynomials is independent of the field over 
which we consider them. 

Following the terminology of Z, we say that two 
polynomials / and g are relatively prime if they have no 
common nonconstant factors. In this case we can write 
(/, g) = 1, where the right side naturally refers to the 
constant polynomial with value 1. Now at last we can 
prove the two basic results mentioned earlier. 
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Theorem 14. Let /, g, h 6 F[x] and let f be irreducible, 
tf f I gh, then f \ g or f \ h. 

Proof. Suppose / does not divide g. Then (/, g) = 1 by 
the fact that / has no nonconstant factors other than itself 
or constant multiples of itself. By Theorem 1 3, there exist 
polynomials s and t G F[x] such that 

\ = sf+ tg, 

so that 

h = sfh + tgh , 

from which, since / | gh, we conclude that / 1 h. ■ 

Theorem 15. Let f G F[x], Then f can be written as a 
product of irreducible polynomials in F[x], and this 
decomposition is unique up to constant multiples of each 
nonconstant factor. 

Proof. Of course, the result is trivial for / a constant 
polynomial. When deg / > 1, it is obvious that / can be 
written as a product of nonconstant irreducible factors: 
just keep factoring it until each factor is irreducible. To 
show essential uniqueness, suppose that we can simul- 
taneously write / = p\p 2 •••/>„ = q x q 2 • ■ • q m , where the 
p's and q’s are nonconstant irreducible polynomials in 
F[x]. By Theorem 14, /?, must divide some q t , so by the 
latter’s irreducibility q t is a constant multiple of p v When 
both sides of the equation are divided by p x , the results 
must be equal, by the uniqueness assertion of the division 
algorithm. We now repeat the process for p 2 , and then p 3 , 
and so on. This also implies that n = m; for otherwise we 
would reach a point where one side has degree 0 and the 
other side does not. i 
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PROBLEMS 

1. Prove the division algorithm for Z: If m, n eZ, 
« ^ 0, then there exist unique integers q and r, with 
0 < r < |r|, such that m = nq + r. 

2. Prove the Euclidean algorithm for Z: If m, n e Z, 
n # 0, then there exist integers s and t such that the g.c.d. 
(m, n) = sm + tn. Moreover, the g.c.d. can be obtained in 
a finite number of steps by the method of repeated 
division of the most recent divisor by the most recent 
remainder, beginning with the division of m by n 
according to the division algorithm, as outlined in the text 
just prior to Theorem 13. 

3. Find the g.c.d. of 264 and 714. Can you take 
them in either order? 

4. Use the Euclidean algorithm for Q[x] to find a 
g.c.d. of /(x) = 2x 6 - 10x 5 + 2x 4 — 7x 3 - 15x 2 + 3x - 15 
and g(x) = x 5 - 4x 4 - 3x 3 - 9x 2 - 4x - 5. 

5. Show that the g.c.d. of two polynomials is 
independent of the field over which we consider them. 
That is, suppose F, and F 2 are fields and / and g are 
polynomials belonging to both F,[x] and F 2 [x]; show that 
they have the same g.c.d. in both F,[x] and F 2 [x]. (Note: 
Some divisibility properties, such as irreducibility, are 
closely related to the field in question, but the g.c.d. is 
seen here not to be.) 

6. Let /, g e F[x], / irreducible. Suppose there exists 
some a G C such that /(a) = g(a) = 0; that is, they have a 
common root in C. Show that / divides g. 

1. Let /, g, h e F[x], / ^ 0 (i.e., to recall our con- 
vention, / is not the 0 polynomial). If fg = fh, show that 
g = h. 

8. Show that (*" - l)/(x - 1) is a polynomial over 
Q for all positive integers n, and determine precisely the 
set of values of n for which it is irreducible. 
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9. Decompose x 10 - 1 into a product of irreducible 
factors in Q[x], 

10. Let / be a polynomial irreducible over F. Show 
that/ has no multiple roots. (Hint: Consider the g.c.d. of / 
and its derivative.) 

11. A polynomial with integral coefficients is said to 
be primitive if its coefficients are relatively prime (that is, 
there is no integer other than ± 1 which divides all the 
coefficients). If / is any polynomial with integral 
coefficients and if g is a primitive polynomial which 
divides /, show that the quotient f/g actually has integral 
coefficients. (As an application of this, observe that if a 
monic polynomial with integral coefficients divides 
another polynomial with integral coefficients, then the 
quotient has integral coefficients.) 

12. If / is a polynomial with integral coefficients and 
if g is a monic polynomial with integral coefficients, show 
that the quotient q and the remainder r, resulting from the 
division of /by g according to the division algorithm, both 
have integral coefficients. 

*13. Develop an algorithm for factoring any po- 
lynomial in Q[x] into irreducible factors. 


Section 2.2. Simple, Multiple, and Finite Extensions 

The matter of extensions of fields arose as early as Section 
1.1, in which it was shown how quadratic extensions are 
fundamental to the theory of constructibility. In this 
section we shall generalize the notion of field extensions 
and develop a coherent theory within which we shall then 
be able to go back and look again at some earlier results. 

If F is a field and if a v a 2 , . . . , a m are any m complex 
numbers, we define the extension of F by a,, a 2 , . . . , a m , 
denoted F(a,, a 2 , . . . , a m ), to be the smallest field 
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containing a,, a 2 , . . . ,a m as well as all the elements of F. 
(By “smallest”, of course, we mean the field which is the 
intersection of all fields containing all these elements.) In 
general, it is called a multiple extension of F, and in the 
particular case m = 1, a simple extension. In addition, if 
each of the a’s is algebraic over F, it is called an algebraic 
extension', otherwise it is a transcendental extension. 

It is natural to ask “What does F(a) look like?”. That is, 
when a is adjoined to F, what other elements must also be 
adjoined so that we still have a field? Questions of this 
nature were first pursued in Section 1.1, especially in the 
problems at the end of the section. For example, we found 
that Q(V2) = {a + by/2 \ a, b E Q), Q(VT, V3) = 
{a + by/2 + c'Jf + cbj 6 | a, b, c, d E Q}, and Q( 5 V^ ) = 
{a + b2 l/i + c2 2/s + d2? /5 + e2*' s \a, b, c, d, e E Q). In 
these cases, then, we were able to express each element of 
the extension as a linear combination of some specific 
finite collection of elements in the extension, the 
coefficients in the linear combination coming from the 
original field. This is strongly suggestive of some vector 
space structure, which is precisely the key to studying 
these extensions. 

First, to generalize even further the concept of field 
extensions, if E and F are fields such that EdF, then E is 
called an extension of F. In this case, E may be considered 
as a vector space over F, where vector addition is simply 
the usual field addition and scalar multiplication is the 
usual field multiplication. (The reader who is not 
accustomed to thinking of a field as a vector space over 
itself, or of the closely related notion here, may wish to 
work through Problem 1 before proceeding further.) The 
dimension of the vector space E over F is called the degree 
of the extension, and it is denoted [E : F]. 

For example, [Q(yT ) : Q] = 2, since a basis for Q(V2 ) 
over Q is given by the set {1,^2”}- Similarly, 
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[Q(V2,V3):Q] = 4, since this extension has as a basis 
{1, fl , V3", V6 }. That this set is a spanning set for the 
extension was shown in Problem 8 of Section 1.1; that it is 
a linearly independent set can be established by the type 
of argument used in Problem 7 of Section 1.1 and in 
Problem 2 of Section 1 .4. In the case of [Q( 5 V2" ) : Q] we 
know it is at most 5, since {1, 2 1/5 , 2 2/5 , 2 3/5 , 2 4/5 } were 
shown in Problem 6 of Section 1.1 to form a spanning set 
for the extension, which itself took a good deal of work; 
the question of linear independence was not treated. 
Recalling these labors, it is natural to ask: Is there some 
simple way to determine the degree of an extension? The 
affirmative answer to this question in certain important 
cases, of which we shall make frequent use, will be 
developed in the next few paragraphs. 

Let us begin with the case of a simple algebraic 
extension F(a) over F. Since a is algebraic over F, there 
exists a polynomial over F having a as a root; and hence 
there exists such a polynomial of smallest degree. Any 
such polynomial is called a minimal polynomial for a over 
F, and its degree is called the degree of a over F, denoted 
deg F a. For example, deg R / = 2, since x 2 + 1 is a minimal 
polynomial for / over R. Given a number a, it is in general 
quite difficult to find directly (or to prove you have 
found) a minimal polynomial for a. The key to the matter 
is the equivalence between the concept of “minimal 
polynomial” and that of “irreducible polynomial”: 

Theorem 16. Let f €E F[x] and suppose f(a) = 0. Then f 
is a minimal polynomial for a over F if and only if f is 
irreducible over F. 

Proof. If / is minimal, then it must certainly be 
irreducible, for if it could be written as a product gh of 
polynomials in F[x] of lower degree, one of them would 
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have to have a as a root, contrary to the assumption of 
minimality. 

On the other hand, if / is irreducible, we may apply the 
division algorithm to divide it by some minimal 
polynomial for a, say g. Thus f=g-q+r. Since 
f(a) - g(a) = 0, we have r(a) = 0, so by the minimality of 
g, r must be the 0 polynomial. But, since / is irreducible 
and deg g > 1, we must have deg g = deg /, so that q is a 
constant. Hence /, since it has the same degree as g, is also 
minimal. ■ 

Since we have an irreducibility criterion for poly- 
nomials in Q[x] with integral coefficients (Theorem 12), 
we are sometimes able to use this theorem to find the 
degree of an algebraic number over Q. For example, 
degg yjl — 5, since 5 fl is a root of the irreducible 
polynomial x 5 - 2. Also, deggCOS 20° = 3, since cos 20° is 
a root of the irreducible polynomial 8x 3 - 6x — 1 (see 
Section 1.3). 

With these ideas in mind, we now return to the problem 
of finding the degree of a simple algebraic extension: 

Theorem 17. If a is algebraic over F, [F(a) : F] = 
deg F a. If a is transcendental over F, [F(a) : F] = oo. 

Proof. For any a, we observe that 

F ( a ) = { g( a )/ h ( a ) I g’ h G F !>]< h ( a ) * °}< 

for this latter set is first of all a field (it satisfies the 
definition) and all its elements must be contained in any 
field containing a and all of F. 

Now, when a is algebraic over F we are able to simplify 
the representation of the elements in this set. In particular, 
letting deg F a = n, we first show that the elements 
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1, a, a 2 , ... , a" -1 are a spanning set for F(a) over F; or 
what is equivalent, every element g(a)/h(a) equals some 
r(a), where r G F[x] and deg r < n — 1 or r = 0. Let / be a 
minimal polynomial for a. If deg h > 1, then (/, h) = 1, 
for / cannot divide h (or else h(a ) = 0), and / is irreducible 
(by Theorem 16). By the Euclidean algorithm there exist 
s,/G F[x] such that 1 = sf + th. Substituting the value a 
for x and noting that f(a) = 0, we obtain t(a) = 1 / h(a). 
So now we can write g(a)/h(a) = g(a)t(a). Application 
of the division algorithm to divide the polynomial gt by / 
yields gt = fq + r, where either deg r < n — 1 or r = 0. 
Again substituting the value a for x and recalling that 
/(a) = 0, we have g(a)t(a) = r(a). 

But the elements 1, a, a 2 , . . . , a n ~ 1 are also linearly 
independent over F, or else a would satisfy a polynomial 
over F of degree < « — 1, contrary to the minimality of n. 
Hence these elements form a basis for the extension, 
completing the proof that when a is algebraic, [F(a) : F] 
= deg F a. 

When a is transcendental over F, we can find linearly 
independent sets with arbitrarily many elements, namely 
{1, a, a 2 , . . . , a"} for any n. This set is linearly 
independent since a nontrivial linear combination of its 
elements would give us a polynomial / over F such that 
/(a) = 0. Hence [F(a) : F] cannot be finite. ■ 

If E is an extension of F and [E : F] is finite, we call E a 
finite extension of F. In a certain sense, the case of finding 
the degree of a finite or of a multiple algebraic extension 
of F is covered by the previous theorem, for we show 
below that these kinds of extensions are themselves simple 
extensions! But while this result is of interest in itself and 
important for later work, it does not provide us with a 
very useful way of finding or representing the degree of 
these extensions. 
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Theorem 18. The following statements are equivalent: 

(i) E is finite extension of F. 

(ii) E is a multiple algebraic extension of F. 

(iii) E is a simple algebraic extension of F. 

Proof, (i => ii) We first observe that every element of 
E is algebraic over F. Let a G E and define n = [E : F], 
Then since the n 4- 1 vectors 1 , a, a 2 , . . . , a n must be 
linearly dependent, there is a nontrivial linear combina- 
tion of them equal to 0, with coefficients in F. Thus a is a 
root of some polynomial over F. Now, if we take a basis 
for E over F, say a,, a 2 , . . . , a n , it is easy to see that E is 
simply the multiple algebraic extension F(a,, a 2 , , a„). 

For E, in that it contains all the elements of F as well as 
a v a 2 , , a n , must contain the smallest field containing 

these elements, which by definition is F(a,, a 2 , ... , a„). 
Likewise, by closure under multiplication and addition, 
F(a„ a 2 , ... , a n ) must contain the set of all linear 
combinations of the a/s, with coefficients in F, which set 
is precisely E. 

(ii => iii) First we note that the multiple extension 
F(a,, a 2 , . . . , a n ) can be achieved by a sequence of simple 
extensions: F, = F(a,), F 2 = F,(a 2 ), . . . , F„ = F„ 

For F(a,, a 2 , . . . , a n ) and F„ must each contain all the 
elements of the other. Thus, if we can show that these 
extensions can be combined two at a time to form simple 
extensions, we can repeatedly apply this result to deduce 
that F(a„ a 2 , . . . , a n ) is a simple extension of F. 

It suffices then to consider the case n = 2, and, altering 
the notation, to show that for any two numbers b and c 
that are algebraic over F, there exists a number a such 
that F(h, c) = F(a). We shall actually find an acceptable a 
in the form of a simple linear combination of b and c, 
a = b + kc, where the coefficient k is a suitably chosen 
element of F. 
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Since b is algebraic over F, it is a root of some minimal, 
and hence irreducible, polynomial / over F, whose 
complete set of roots are b = b v b 2 , . . . , b q . These roots 
are all distinct since an irreducible polynomial cannot 
have multiple roots (Problem 10 of Section 2.1). Similarly, 
c is the root of some irreducible polynomial g, whose roots 
are c = c,, c 2 , . . . , c m , where these are also all distinct. 
Look at the q(m - 1) equations in x: b t + xcj = b, + xc t , 
for 1 < i < q, 2 < j < m. Since for these values of j, 
Cj / c,, each one of these equations has exactly one 
solution in C for x, and hence at most one solution in F 
for x. Since F has an infinite number of elements, we can 
choose a number k G F such that it is not any one of these 
solutions. Thus, h { + kcj ± b s + kc^ whenever 1 < i < q 
and 2 < j < m. 

Defining a = b + kc, we will now show that F(f>, c) 
= F(a). First, since a G F(6, c), we have F (a) c F (b, c). 
To show that V(b, c) c F(a), it suffices to show that both 
b and c G F(a). Indeed if we simply show that c G F(a) we 
shall be done, for then b = a - kc G F(a), since F(a) is a 
field. 

We will show that c G F(a) by showing that c is the root 
of a polynomial of degree 1 over F(a). Look at the 
polynomials g(x) and f(a - kx), both of which are 
certainly polynomials over F(a). (In fact, g G F[x] 
C (F(a))[x].) The number c is a root of each, as g(c) = 0 
and f(a - kc) = f(b) = 0. Since their g.c.d. is divisible by 
(x — c) in C[x], it has c as a root; the multiplicity of c as a 
root of the g.c.d. is exactly 1 since c is a root of 
multiplicity 1 of g. But, by the choice of k, the two 
polynomials g(x) and /(a - kx) have no other common 
roots, for the other roots of g are of the form c,, 
2 < j < m, and a - kcj # b, for any i. Thus, the g.c.d. of 
g(x) and f(a - kx) has degree 1, for all of its roots are 
common roots of these polynomials. But the g.c.d. is a 
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polynomial over F(a), the field of coefficients of g(x) and 
f(a - kx). Thus c, a root of the g.c.d., is a root of a 
polynomial of degree 1 over F(a). Hence c e F(a). 

(iii => i) This is immediate from Theorem 17. ■ 

Much more important in the actual calculation of the 
degree of field extensions is the following theorem: 

Theorem 19. Suppose E, K, and F are fields such that 
EdKdF. Then [E : F] = [E : K][K : F], where if either 
factor on the right is oo, the product is taken to be oo. 

Proof. Suppose e v e 2 , . . . , e n are linearly independent 
vectors of E over K, and k v k 2 , . . . , k m are linearly 
independent vectors of K over F. Then the nm vectors e,kj, 
1 < i < n, 1 < j < m, are linearly independent vectors of 
E over F. For, if we have a linear combination of them 
equal to 0, 2"- i 27 - i( a ij k j) e i ~ °> % e F > then b y the 
linear independence of the e’s over K, it follows that for 
each i, 27- \ a jikj = 0. But now the linear independence of 
the kfs over F implies that each a Xj = 0. This shows linear 
independence, since there can be no nontrivial linear 
combination equal to 0. 

Thus, if either [E : K] or {K:F] is oo, one of n or m 
may be taken arbitrarily large, implying that [E : F] = oo. 
If they are both finite, we can take the e,’s and the kfs in 
the previous paragraph as bases of their respective vector 
spaces. Thus it simply remains to show that the set of nm 
vectors e t kj spans E over F. Let t> be an arbitrary element 
of E. Since with n = [E : K], the vectors, e t , e 2 , . . . , e„ 
form a basis for E over K, there exist elements 
b x , b 2 , ... ,b„ of K such that v = 2"_ Ac,- But each 
being in K, can be expressed in the form h, = 27- i a ij^p 
where each a y E F. Combining this with the previous 
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equation, v = 2". i 27 - \ a ij e ,kp which represents v as a 
linear combination over F of the e t k 's. i 

This relatively simple theorem will be applied often in 
the development of subsequent results. Let us observe 
here that by its repeated application to a finite sequence 
of field extensions F N d F n _ i D • • • D F, D F 0 , we may 
conclude that 

[F* : F 0 ] = [F„ : F„_,][F„_, : F„_ 2 ] • • • [F, : F 0 ], 

PROBLEMS 

1. If E is an extension of F, verify that E may be 
considered as a vector space over F. (Recall the axioms 
for a vector space: A set V is said to be a vector space over 
a field F if it has the following properties: (a) there is a 
rule for adding vectors, always resulting in an element of 
V; (b) this addition operation is associative and com- 
mutative; (c) there is a vector called 0, such that v + 0 = v 
for every v, (d) for every v there is a vector called — v, 
such that v + ( - v) = 0; (e) there is a rule, called scalar 
multiplication, for multiplying a scalar times a vector, 
always resulting in an element of V; (f) for a, b E F and 
u, v E V, ( ab)v = a(bv), a{u + v) = au + av, and (a + b)v 
= av + bv, (g) for every v G. V, \ v = v, where 1 is simply 
the number 1 in F.) 

2. Show that if E is a quadratic extension of F, then 
[E : F] = 2. 

3. Let E be an extension of F. Show that [E : F] = 1 
if and only if E = F. 

4. If E is an extension of F such that every element 
of E is algebraic over F, we say that E is algebraic over F. 
Is every algebraic extension a finite extension? 
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5. If E is an algebraic extension of K and K is an 
algebraic extension of F, show that E is an algebraic 
extension of F. (See the previous problem for the 
definition of an algebraic extension.) 

6. Prove Theorem 8 by using the ideas of this 
section. (Hint: Observe that if a and b are algebraic over 
F, then F(a, b) is a finite extension of F.) 

7. Express Q(/2" , VJ ) as a simple extension of Q. 
Find the degree of this extension over Q. 

8. Let / be a polynomial of degree n over F. Show 
that there exists an extension E of F in which / is 
completely reducible (i.e., can be factored into linear 
factors) and such that [E : F] < n ! . 

9. Let a be transcendental over F. Show that the set 
{a k | k e Z} is linearly independent in F(a) over F. Is it a 
basis for F(a) over F? 

10. Let / e R[x] with deg/ >3. Show that / is 
reducible. 

11. Use the result of the previous problem to show 
that for / e R[jc], a is a root if and only if its complex 
conjugate a is a root. (The complex conjugate of b + ci is 
b — ci, where b, c £ R.) 

*12. If E is a finite extension of F, show that there are 
only a finite number of intermediate fields K, E D K D F. 
(This may be surprising.) 

13. Suppose that E = F(r), where r is a root of an 
irreducible polynomial / over F. If r is another root of / 
and if r £ E, show that E = F(r). 


Section 2.3. Geometric Constructions Revisited 

A considerable amount of material was covered in the two 
previous sections, and so it is nice to be able to relax at 
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this point and look back at some old, familiar 
construction problems, which have very simple solutions 
in terms of the recent material. 

Recall that the duplication of the cube involved the 
construction of a root of the polynomial x 3 — 2, which is 
irreducible over Q. Similarly, the trisection of an angle of 
60° involved the construction of a root of the polynomial 
x 3 — 3.x - 1, which is also irreducible over Q (Section 
2.1). In each case it was required to construct a number 
whose degree over Q was 3. But by using the theory of 
field extensions, it is very simple to show that the only 
numbers that can possibly be constructed must have some 
power of 2 as their degree over Q. That is, we have: 

Theorem 20. If a is construclible, then deg Q a must be 
a power of 2. 

Proof. If a is constructive, then by Theorem 1 there is 
a sequence of fields Q = F 0 c F, c • • • C F^ such that 
a e F^ and for each j, F J+ , is a quadratic extension of F y . 
Thus, for each j, [F / + , : F y ] = 2, by Problem 2 of Section 
2.2 (of course, it’s practically obvious). By the repeated 
application of Theorem 19, [F w : Q] = 2 N . Since a G F^, 
Q («) C F„, so that by Theorem 19 again, 

[F„:Q]=[F „:Q(«)][Q(a):Q]. 

Since the left side is 2 N , neither factor on the right can 
have any odd prime factors. Thus, [Q(a) : Q] must be a 
power of 2, and by Theorem 17 this equals deg Q a. ■ 

Let us even state separately, as a corollary, the 
following immediate reformulation: 

Corollary 20. It is not possible to construct any 
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number which is the root of an irreducible polynomial over 
Q that has degree other than a power of 2. 

It follows immediately from this result that it is not 
possible to duplicate the cube, as that would require the 
constructibility of a root of an irreducible polynomial over 
Q of degree 3. Similarly, it is not possible to trisect 60°, 
which shows that there can be no way (within the usual 
rules) to trisect an arbitrary angle. Although the original 
solutions to these problems were rather simple, one senses 
that the present method is more general and powerful. In 
fact, as the reader will see in Problem 1, it is now very 
easy to solve the problem of dividing an angle into five 
equal parts, which was quite difficult under the approach 
used in Chapter 1 for these other problems. (Cf. Problem 
3 of Section 1.3.) 

With these methods now available, we are finally able 
to tackle another famous problem: For what values of n Is 
it possible to construct a regular polygon of n sides? Such a 
polygon is called a regular n-gon. 
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First we observe that the constructibility of a regular 
n-gon is equivalent to the constructibility of an angle of 
360 /n degrees. For, if 360 jn degrees is constructible, we 
can inscribe a regular n-gon in the unit circle by 
successively marking off central angles of this amount, as 
illustrated in Figure 6. Conversely, if a regular n-gon is 
constructible, each of its exterior angles is constructible, 
and since these add up to 360°, each is 360°/n (see 
Figure 7). Alternatively, one could construct the center of 
the circle in which the n-gon is inscribed by taking the 
intersection of the perpendicular bisectors of any two 
nonparallel sides. One then draws the circle, and the arcs 
between any two successive vertices each subtend a 
central angle of 360 °/n. 

It would be possible at this point to find an equation 
relating cos(360°/n) and cos(360°), which equals 1 and 
hence is constructible, and to try to determine whether 
cos(360°/ n) is constructible by using this equation. 
However, it is much simpler to effect a further 
reformulation of the problem first. The constructibility of 
an angle of 360 °/n is obviously equivalent to the 
constructibility of each of the points (cos(A: 360 ° / ri). 


\ 



Figure 7 
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sin(/c 360 °/h)) in R 2 . In the next section we shall see that 
these points can be represented by complex numbers. 
Further, we shall see that the whole theory of 
constructibility can be extended rather simply to the 
complex numbers. Within this framework, we shall then 
be able to solve this problem. 


PROBLEMS 

1. Use the techniques of this section to show that it is 
not possible to divide an arbitrary angle into five equal 
parts by the usual rules of construction. 

*2. Let S = {6 | 9 is a trisectible angle}. Prove that S 
is countable. (Keep in mind the distinction drawn at the 
end of Section 1.3.) 


Section 2.4. Roots of Complex Numbers 

Every complex number a j* 0 has n distinct nth roots. 
For the polynomial x n - a has exactly n roots (Theorem 
7), and none has multiplicity greater than one, for then it 
would also be a root of the derivative nx n ~\ which is 
impossible since a ^ 0. The nth roots of the number 1 are 
called the nth roots of unity, and they are particularly 
important for our work. 

As an example, the cube roots of unity are the roots of 
the polynomial x 3 — 1. Since 1 is obviously one root, we 
can divide by x - 1 to get x 2 + x + 1 whose roots 
(— l±iV3)/2 are the other cube roots of unity. The 
fourth roots of unity are the roots of x 4 — 1 , which can be 
factored into (x 2 + l)(x 2 - 1) from which we obtain the 
four values, +i, —i, +1, -1. 
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A convenient representation of the nth roots of unity is 
provided by the complex exponential function, introduced 
in Section 1 .7. For by direct computation we find that the 
nth roots of unity are si mply the numbers where 

1 < k < n and where i =V~ 1 . To see this, we calculate 


irik/n^ n = £ 2irik _ ( £ 2m = J* _ 


and note that e 2mk/n gives n distinct numbers for 
1 < k < n. In terms of the trigonometric functions, 
e 2 mk/n _ cos (2nk / n) + i sin( 27 rk/n). 

For a fixed n, some of the nth roots of unity may have 
the property that they are also mth roots of unity for some 
m < n. For example, although - 1 is a fourth root of unity 
(n = 4), it is even a square root of unity (m = 2). Those 
nth roots of unity which are not mth roots of unity for any 
m < n are called primitive nth roots of unity. There is 
always at least one primitive nth root of unity, as can be 
seen by taking k - 1 in the complex exponential form. If 
u> is any primitive nth root of unity, then the numbers 
w, « 2 , <o 3 , . . . , to" = 1 are the entire list of nth roots of 
unity. To see this, observe first that they are all roots of 
unity, for (« 7 )" = u> J " = (w") v = V = 1. And they are 
distinct since if u 7 = u k with 1 < k <j < n, we would 
have w 7_ * = 1, thus contradicting the primitive nature of 
«, since 1 < j — k < n — 1. 

The Euler <j>- function , defined by <j>(n) = the number of 
integers between 1 and n inclusive which are relatively 
prime to n, provides a means of counting the number of 
primitive nth roots of unity, for we have: 


Theorem 21. If u is a primitive nth root of unity, then 
<o k is also a primitive nth root of unity if and only if k and n 
are relatively prime. The number of primitive nth roots of 
unity is (j>(n). 
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Proof. The second sentence will clearly follow im- 
mediately from the first since all the roots of unity are 
represented by the values of k ranging from 1 to n. Now, 
suppose k and n are relatively prime. If there were some 
m, 1 < m < n, such that (w k ) m = 1, then km must be some 
integral multiple of n; otherwise, if km = nq + r, as in the 
division algorithm for Z, (u k ) m = co* m = = u r = 1, 

with 1 < r < n — 1, contrary to the primitive character of 
to. Thus n | km; but since n and k have no common 
factors, n \ m, which is impossible since n > m. Con- 
versely, if k and n have a common prime factor p, then w* 
is an mth root of unity for m = n/p<n. For, 
( u k ) n/p = (u n ) k/p = \ k/p = 1, since k/p is an integer. ■ 


Let us also observe that if r is any nth root of the 
complex number a # 0 and if w is a primitive nth root of 
unity, then the n distinct nth roots of a are given by 
ru, ru 2 , rto 3 , . . . , ru" - r. 

There is a useful geometric interpretation of complex 
numbers and their arithmetical operations. Let z = x + iy, 
where x and y are real. We represent z in the Cartesian 
plane by the point ( x,y ). Similarly to polar coordinates 
we can write 


y 


= x 2 + y 2 x + i 

y x 2 + y 2 ^x 2 + y 2 

= |z|[cos 9 + i sin 9] 


= Izle* 


since the point (x/^Jx 2 + y 2 ^/yjx 2 + y 2 ), being on the 
unit circle, can be represented by (cos 9, sin 9) for some 9. 
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The a ngle 6 is called an argument of z; the number 
|z| = \jx 2 + y 1 is called the modulus of z. The situation is 
depicted geometrically in Figure 8. The representation of 
a complex number z in the form \z\e' a is called its polar 
representation. It gives a very convenient way to picture 
the multiplication of complex numbers. Writing z, 
= |z,|e' #1 and z 2 = \z 1 \e‘ e ' t , the laws of exponents imply 
that z,z 2 = |z,| |z 2 |e' (S|+Sj) , so that the modulus of a 
product is the product of the moduli and the argument of 
a product is the sum of the arguments. This is shown 
graphically in Figure 9. Thus, an nth root of a complex 
number z may be obtained by taking the real positive root 
of its modulus and dividing its argument by n. That is, an 
nth root of z = \z\e‘ e is given by r = 1J\z\ e' e/n . It is 
important that the reader see this by both analytical and 



Figure 8 
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geometrical reasoning. Analytically, 

r" = ("\/j7j" e w/n ) = \z\e iB = z. 

Geometrically, the situation is suggested in Figure 10. 

From the graphical representation of complex numbers, 
we find that the «th roots of unity are represented by the 
points (cos(2wi/c/n), sin(2irik / n)), 1 < k < n, which are 
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simply n equally spaced points around the unit circle, 
including the point (1,0). If we take u = e 2m ^ n , these 
points are marked and labelled as powers of w in Figure 
11. Note that these are precisely the points that must be 
constructed in order to construct the regular n-gon! 

With respect to the n nth roots of an arbitrary z, they 
are simply n equally spaced points on a circle of radius 



Figure 12 
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Vfzf ; but, of course, their orientation will in general be 
different from that of the roots of unity. For example the 
five fifth roots of 2/ are graphed in Figure 12. 


PROBLEMS 

1. Determine the cube roots of unity by a geo- 
metrical analysis. 

2. Evaluate <j>(k) for k = 1, 2, . . . , 15. 

3. Show that <j>(p) = p - 1 for p a prime. 

4. Determine <f>(p n ) for p a prime and n a positive 
integer. 

5. Would it make sense to discuss “primitive” nth 
roots of 2? 

6. How many primitive 20th roots of unity are 
there? 

7. Describe the location of the seventh roots of 
1 — / in the complex plane. 

8. Show that if a is an nth root of unity, then a is a 
primitive mth root of unity for some m such that m \ n. 

9. Show that n <p(d) = n, where the sum is over 
all positive integers d which divide n. Verify this result by 
direct calculation for the case n = 12. 

10. Determine the degree over Q of the primitive nth 
roots of unity for n = p and n = p 2 , where p is a prime. 

11. If k and n are relatively prime positive integers, 
show that 2,/V* = 0, where the r’s are the n distinct nth 
roots of unity. 

*12. Determine the degree over Q of each of the 
primitive 20th roots of unity. (This is not the same 
question as in Problem 6.) 
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*13. Show that the degree over Q of each of the 
primitive nth roots of unity when n- p m , a power of a 
prime, is exactly <p(n). 


Section 2.5. Constructibility of Regular Polygons I 

In this section it will be possible to make substantial 
progress on the question formulated in Section 2.3, 
namely: For what values of n is it possible to construct a 
regular n-gon? It was seen in Section 2.4 that the 
constructibility of the regular n-gon is equivalent to the 
constructibility of the points in the Cartesian plane 
corresponding to the nth roots of unity. Since these are 
complex numbers, it will be convenient to extend the 
definition of constructibility directly to complex numbers. 

For any number a E C, we say that a is constructible if 
both its real and imaginary parts are constructible. This is 
obviously equivalent to the constructibility of the point 
representing a in the Cartesian plane. The theory of 
constructibility extends immediately to complex numbers, 
as shown in Lemmas 22a and 22b and Theorems 22 and 
23. 

Lemma 22a. If a, and a 2 are constructible, then so too 
are a, + a 2 , a, - a 2 , a,a 2 , and, when a 2 # 0, a x /a 2 . 

Proof. Let b t and c, denote, respectively, the real and 
imaginary parts of each a,. From the computations below, 
we see that the real and imaginary parts of the results of 
combining a x and a 2 by each operation are each 
constructible by Lemma la, which is the analogue of the 
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above for real numbers: 

a, + a i = (P\ + b 2 ) + * (c, 4- c 2 ) 

a x -a 2 = {b x - b 2 ) + i(c, - c 2 ) 

a x a 2 = (b x + ic x )(b 2 + ic 2 ) 

= (b x b 2 - c,c 2 ) + i(b x c 2 + b 2 c x ) 

a\/a 2 = {b x + ic x )/{b 2 + ic 2 ) 

= [(/>! + ic x )(b 2 - ic 2 )]/[(b 2 + ic 2 )(b 2 - ic 2 )] 

= [(b x b 2 + c x c 2 )/(bl+ C j)] 

+ i[(b 2 c x - b x c 2 )/(bl + cl)]. ■ 

Lemma 22b. If a is constructible and if k 2 = a, then k is 
constructible. 

Proof. It is obvious that a complex number z = \z\e' 9 
is constructible if and only if the real number \z\ and the 
angle 9 are constructible. Thus the number \a\ and an 
angle 0^an argument of a, are constructible. By Lemma 
lb, VH is constructible, and since an angle can be 
bisected, the angle 6/2 is constructible. But k 
- ± V| a | e‘ e/1 , so k is constructible. ■ 

The proof of the preceding lemma had a strong 
geometric flavor, and one should not hesitate to use such 
reasoning when it serves the purpose. The same kind of 
reasoning could be used to give simple alternate proofs of 
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the multiplication and division parts of Lemma 22a. (See 
Problem 1.) Alternatively, Lemma 22b can be proved 
without reference to the polar form of a. (See Problem 2.) 

In the statement of Lemma 22b, reference was made to 
a number k such that k 2 = a\ but we did not write Vo" . 
The reason is this. Whereas for positive real numbers a, 
•{a is well defined as the unique positive square root of a , 
when a is complex, we have no such natural way to 
denote a particular square root of a. With this caution in 
mind, we shall agree to use the symbol {a or "Va as an 
abbreviation for an arbitrary square or «th root of a in 
cases where the statement we make is true for all such 
roots. Thus, it would now be acceptable to write Lemma 
22b as: “If a is constructible, then ija is constructive.” 
To be consistent with this new convention, if a is a 
positive real number, we shall use the symbol + fa to 
denote its positive square root. 

By a quadratic extension of an arbitrary field F we mean 
a simple extension F(fa ), where a G F and fa F; since 
the square roots of a are negatives of each other, the 
extension by either is the same field. Theorem 1 provided 
a necessary and sufficient condition for constructibility of 
real numbers, and it extends quite easily to the complex 
case: 

Theorem 22. The following two statements are equiva- 
lent: 

(i) The number a is constructible. 

(ii) There exists a finite sequence of fields Q = F 0 c F, 
C • • • C F N , such that a E.F N and for each j, 0 < j 
< N — 1, Fy + , is a quadratic extension of Fj. 

Proof. (i=*>ii) Suppose a = b + ic, b,cG R. By 
Theorem 1 and the constructibility of b and c, there are 
two sequences of quadratic extensions: Q = G 0 c G, c 
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• • • cG t with b G G l and Q = H 0 CH,C C 
H m with cGH m . For each y, H ;+ , = ). If we 

extend G L , one step at a time, by y . . . , , , 

and i, omitting in the sequence of fields any repetitions, 
we have the desired sequence of quadratic extensions, for 
the final field contains b, c, and /, and hence a. 

(ii => i) By induction on IV. If N = 0, a is rational and 
hence constructible. Now we assume the theorem is true 
for N — R and we prove it for N = R + 1. If a G F^ +1 , 
then a = a R + b R yfk^ , where a R , b R , and k R all belong to 
¥ r , and hence are constructible. But then, by Lemmas 22a 
and 22b, a is constructible. ■ 

By exactly the same reasoning as before, we use 
Theorem 22 to obtain the analogues for complex numbers 
of Theorem 20 and its Corollary: 

Theorem 23. If a is constructible, then deg Q a must be 
a power of 2. Thus it is not possible to construct any number 
which is the root of an irreducible polynomial over Q that 
has degree other than a power of 2. 

Now we are ready to tackle the question stated at the 
beginning of this section. The constructibility of the 
regular «-gon is equivalent to the constructibility of the 
nth roots of unity, and hence, in view of Lemma 22a, to 
the constructibility of any primitive nth root of unity 
(since any nth root of unity is a product of a primitive 
root with itself an appropriate number of times). Our 
approach will be in two parts. First we will use Theorem 
23 to narrow down the candidates for values of n for 
which the regular n-gon might be constructible. Then we 
shall show directly that for each one of these candidates 
the construction is actually possible. 
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One way to proceed with the first task would be to 
compute the degree of a primitive nth root of unity, and 
then to rule out all those values of n for which this degree 
is not a power of 2. It turns out that the 4>(n) primitive nth 
roots of unity are precisely the set of roots of a single 
irreducible polynomial over Q, with integral coefficients, 
called the nth cyclotomic polynomial, which thus has degree 
<#>(«). But to show this is quite difficult (see Problem 5, 
Section 2.7), and so we adopt a simpler course here that 
does not depend on this result. In particular, we simplify 
the problem by looking at the divisors of n. 

Lemma 24a. If the regular n-gon is constructible, then 
so too is the regular m-gon for any m > 3 such that m \ n. 

Proof. If n = mk, just connect every kth vertex of the 
n-gon, beginning at any one. a 

Lemma 24b. If the regular n-gon is constructible and if 
the odd prime p divides n, then p is of the form 2 (2t) + 1 . 

Proof. By the hypothesis and Lemma 24a, the regular 
/?-gon is constructible; and so a primitive pth root of unity 
is constructible. But the — 1 primitive pth roots of unity 
are the roots of the polynomial (x p - l)/(x - 1) = 
x p ~ x + x p ~ 2 + • • • +1, which was shown in Problem 8 
of Section 2.1 to be irreducible. By Theorem 23 then, 
p - 1 must be a power of 2; that is, p = 2 m + 1 for some 
positive integer m. However, the only time a number of 
this form can possibly be prime is when m = 2 k for some 
k. To see this, note that for m = 1 we can take k = 0, and 
for m > 1, if m has an odd factor q, so that m = qr, then 

2 m + 1 = [(2 r ) ? + 1] = [2 r + 1 ][(2 r ) 9 ~ 1 - (2 r y~ 2 + 

(2 r )’ -3 - • • • — 2 r + 1], which is not prime. ■ 
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Primes of the form 2 (2 *> + 1 are called Fermat primes-, 
the only ones known to date are 3, 5, 17, 257, and 65537. 
Thus, if the regular n-gon is constructible, all the odd 
prime factors of n must be Fermat primes. Even further, 
no odd prime can divide n more than once, for we have: 

Lemma 24c. If the regular n-gon is constructible and if 
p is an odd prime, then p 2 does not divide n. 

Proof. From Problem 4 of Section 2.4. we know that 
•Kp 2 ) = pip ~ !)• ^ P 2 1 n > the pip ~ 1) primitive p 2 roots 
of unity would have to be constructible. But these each 
have degree pip - 1) over Q, as was shown in Problem 10 
of Section 2.4. Since their degree has the odd factor p, 
they cannot be constructible. ■ 

Putting the previous two lemmas together, we im- 
mediately obtain the following necessary condition on n 
for the constructibility of the regular n-gon: 

Theorem 24. For the regular n-gon to be constructible, 
n must be of the form 2 k p l p 1 - • • p m , where the p,'s are 
distinct Fermat primes, (m may equal 0, in which case n has 
no odd factors.) 

As will be shown in Section 2.7, the converse of this 
theorem is also true. That is, for any n of the given form, 
it is indeed possible to construct the regular n-gon. But the 
proof of this is rather intricate and depends on certain 
number-theoretic results which we shall develop in the 
next section. 

For the time being, we might observe that Theorem 24 
implies that it is not possible to construct regular polygons 
with, for example, 7, 11, or 90 sides. For another look at 
the trisection problem, observe that the trisection of 60° 
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would necessitate the constructibility of a regular 18-gon; 
and since 18 = 2 • 3 2 , this is impossible. 


PROBLEMS 

1. Give a proof of the multiplication and division 
parts of Lemma 22a using the polar representation of a, 
and a 2 . 

2. Give a proof of Lemma 22b which does not use 
the polar representation of a. 

3. Give an analytic proof of Lemma 24a. 

4. Prove that the regular pentagon is constructible. 

5. Is it possible to divide an angle of 60° into five 
equal parts? 


Section 2.6. Congruences 

In this section, we shall develop some basic notions about 
congruences, which are a very important tool in number 
theory and in algebra. All the numbers referred to will be 
assumed to be integers. 

If n is a positive integer, we say that a is congruent to b 
modulo n, written a = b (mod n), if n \ (a - b). This is 
easily seen to be equivalent to a and b having the same 
remainder when divided by n. For example, 21 = 6 (mod 
5) since 5 | (21 — 6); equivalently, each has remainder 1 
when divided by 5. Similarly, -17 = 3 (mod 10) since 
10 | ( - 17 - 3); equivalently, each has remainder 3 when 
divided by 10. (Review the division algorithm for Z, if 
necessary, in Problem 1 of Section 2.1.) 

A statement of the form a = b (mod n) is called a 
congruence, and we shall now proceed to show that 
congruences behave very much like equations. In 
particular, if a = b (mod n) and c = d (mod ri), then both 
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a + c = b + d (mod ri) and ac = bd (mod n). These follow 
immediately from the definition, noting for the second 
case that ac — bd = ac - ad + ad - bd = a(c — d) + 
d(a - h). Thus we can add (and subtract) and multiply 
congruences. Since we generally can’t divide in Z, we do 
not discuss the general division of one congruence by 
another. However, we do have the following cancellation 
property. Suppose that a and n are relatively prime, which 
we can abbreviate by writing the g.c.d. (a, n) = 1 . Then 
the congruence ab = ac (mod n) implies that b = c (mod 
n), for if n | a(b - c) but n has no factors in common with 
a, then n | (b - c). This is completely analogous to the 
division of an ordinary equation by a nonzero factor of 
both sides. Lastly, congruences have the three properties 
of an equivalence relation : (i) for every a, a = a (mod «); 
(ii) if a = b (mod n), then b = a (mod «); and (iii) if a = b 
(mod n) and b = c (mod n), then a = c (mod n). These 
also follow at once from the definition. 

The main value of congruences is the simple 
representation they afford for certain kinds of properties 
and calculations involving integers. For example, to show 
that the product of two odd numbers a and b is odd, we 
could represent a as (2n + 1) and b as (2m + 1) and 
calculate ab - (In + l)(2m + 1) = 4 nm + 2 n + 2m + 1 
= 2(2 nm + n + m) + 1, from which it follows that ab is 
odd. The same calculation, written in the language of 
congruences, might go like this: a = 1 (mod 2), b = 1 
(mod 2), and hence ab = 1 (mod 2). Much of the 
development of mathematical thought is due to the 
invention of simple notation that can be effectively 
manipulated; congruences are an excellent example of 
this. 

It is not the purpose here to make a detailed study of 
congruences, which are ordinarily treated in number 
theory, but rather to develop a particular tool, Theorem 
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27, which will enable us to return effectively to the 
problem of the construction of the regular «-gon. The 
auxiliary results developed here will also be useful later 
on, and the reader should especially note some very close 
similarities between the statements and proofs here and 
those of earlier sections. 

Frequent use will be made of the Euler ^-function; 
recall that $(n ) represents the number of integers between 
1 and n (inclusive) which are relatively prime to n. To 
begin, we have Fermat’s Theorem: 

Theorem 25. If a and n are relatively prime, then 

a H») = j n y 

Proof. Let r v r 2 , ... , r^ n) be the (distinct) numbers 
between 1 and n which are relatively prime to n. If j is 
any integer at all which is relatively prime to n, then for 
some unique /, s = r j (mod ri), for by the division 
algorithm we can write s = nq + r, and r cannot have a 
factor in common with n or else j would too. Thus 
r = some r ( , and so 5 = r, (mod «). In particular, each 
number of the form ar t , since it has no factors in common 
with n, must satisfy ar t = r } (mod ri) for some j. Further, if 
i ^ k, then ar t ^ ar k (mod n), for the contrary would 
imply that r, = r k (mod n), which is impossible since r, and 
r k are distinct numbers between 1 and n. Thus, 





Ft ar, 

=n o 

(mod ri). 

i = 1 

J - 1 





Mn) II r, 

-n, 

(mod ri). 

i = 1 

J - > 



Since and n have no common factors, we conclude 

that a* (n) = 1 (mod n). ■ 
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For;? a prime, we know that <j>(p) = p - 1 and so in this 
important case we have the following immediate 
corollary: 

Corollary 25. If p is prime and if a is not divisible by 
p, then a p ~ 1 = 1 (mod p). For any a, a p = a (mod p). 

In the context of the first sentence of this corollary, if 
further a m ^ 1 (mod p) for each m such that 
1 < m < p — 2, then we say that a is a primitive root 
modulo p. (Watch the developing analogy, especially in the 
proofs, with primitive roots of unity.) It is not at all 
obvious that there will necessarily even exist a primitive 
root modulo p. In our proof of this fact, we shall need the 
following theorem: 

Theorem 26. If f(x) = a n x n + a n _ ] x "~ 1 + • • • + a 0 
is a polynomial with integral coefficients and if p is a prime 
which does not divide a n , then the congruence f(x) = 0 (mod 
p) has at most n incongruent integral solutions. 

Proof. By induction on n. For n = 0, there are no 
solutions, since then f(x) = a 0 ^ 0 (mod p). Assuming 
that the theorem holds for all polynomials of degree n — 1, 
we proceed to show it for each polynomial / of degree n. 
If the congruence f(x) = 0 (mod p) has no roots, then of 
course we are done. In the other case, we shall make use 
of the factorization, for any x and r, 

/(*) -/('•)= i «*(**-'*) 

k = 1 

n 

= (x - r) 2 a k {x k ~' + x k ~ 2 r + x k ~ 3 r 2 
k = I 

+ • • • +r k ~ x ) 


= (x- r)g(x), 
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where g is a polynomial of degree n - 1 with leading 
coefficient a n , which is not divisible by p. Now if r is an 
integer, g(x) has integral coefficients. If in addition r is a 
root of the congruence f(x) = 0 (mod p), we may rewrite 
the problem as 

0 = /(■*)=(* - »•)«(■*) (mod p). 

If x is any root incongruent to r, the cancellation property 
implies that 0 = g(x) (mod p), and so, by the inductive 
hypothesis, there are at most n - 1 roots incongruent to r. 
Thus, there are at most n incongruent solutions to the 
original congruence. i 

Now we are able to settle the matter at hand, namely, 
the existence of primitive roots modulo p, for p a prime. 

Theorem 27. If p is a prime, then there exist exactly 
<f>(p ~ 1) incongruent primitive roots modulo p. 

Proof. Consider the p - 1 incongruent numbers 
1, 2, 1. In view of Fermat’s Theorem, for each 

such number a there exists some minimal exponent h > 1 
such that a h = 1 (mod p). But h must in fact be a divisor 
of p - 1, for if we apply the division algorithm for Z to 
obtain p — 1 = hq + r, we see that a p ~ l = ( a h ) q a r , and so 
a r = 1 (mod p), since a p ~ ] and a h are both = 1 (mod p). 
By the minimality of h, r = 0; thus h \ (p — 1). 

For a fixed value of h dividing p — 1, how many a’ s 
have this property, namely, that h is the least positive 
exponent for which a h = 1 (mod p)1 (Such a’s are said to 
belong to the exponent h modulo p, or to have order h.) If 
there is one such number a, the candidates must be 
restricted to the incongruent numbers a, a 2 , . . . , a h , for 
these are each solutions to x h = 1 (mod p); and since 
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there are h of them, Theorem 26 guarantees that there 
cannot be any others. Even among these, the only possible 
candidates must be of the form a k for k and h relatively 
prime. For if k and h have a common factor q > 2, then 
(a k ) h ^ = (a h ) k / q = 1 (mod p), so that a k would have 
lower order than h. (Each of these candidates actually 
turns out to have order h, as will follow shortly.) 

Thus, for each h, there are at most <p(h) incongruent 
numbers with order h. Letting \p(h) be the precise number 
of incongruent numbers with order h, we now have 

p-\= 2 Hh)< 2 m-p- 1. 

h\p-\ h\p-\ 

where the last equality was proved in Problem 9 of 
Section 2.4. Thus, for each h, it must be that \j/(h) = <j>(h), 
and so in particular, \p(p - 1) = <K/> - 0- ■ 

For example, for p = ll, there should be <j>(10) = 4 
primitive roots. The table in Figure 13 gives the successive 
powers of each integer from 1 to 10, calculated modulo 


Numbers 

1 

2 

3 

Exponents 

4 5 6 7 

8 

9 

10 

1 

1 










2 

2 

4 

8 

5 

10 

9 

7 

3 

6 

1 

3 

3 

9 

5 

4 

1 






4 

4 

5 

9 

3 

1 






5 

5 

3 

4 

9 

1 






6 

6 

3 

7 

9 

10 

5 

8 

4 

2 

1 

7 

7 

5 

2 

3 

10 

4 

6 

9 

8 

1 

8 

8 

9 

6 

4 

10 

3 

2 

5 

7 

1 

9 

9 

4 

3 

5 

1 






10 

10 

1 










Figure 13 
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11, until the number 1 is reached. Note that the four 
numbers 2, 6, 7, and 8 are primitive roots. Note also that 
the order of each number divides <#>(1 1), and that for each 
h dividing <#>(11), the number of numbers with this order is 
<#>(/j). These are illustrations of the facts developed in the 
proof of Theorem 27. Of course, instead of 2, 6, 7, and 8, 
we could have chosen any numbers congruent to them 
modulo 11, such as 13, -5, 95, and —36. 


PROBLEMS 

1. Use congruences to reformulate and prove the 
theorem that every perfect square is of the form 4 n or 
4 n + 1. 

2. If a ^ 0 (mod n) and b ^ 0 (mod n), does it 
follow that ab ^ 0 (mod «)? 

3. If a and n are relatively prime, show that there 
exists an integer b such that ab = 1 (mod n). 

4. With the obvious definition, does there always 
exist a primitive root modulo n, when n is not prime? 

5. Find all the primitive roots modulo 7. 

6. How many primitive roots modulo 17 are there? 
Find one. 


Section 2.7. Constructibility of Regular Polygons II 

In Section 2.5 it was shown that the only values of n for 
which it might be possible to construct the regular n-gon 
are numbers of the form 2 k p x p 2 • • ■ p m , where the pi s are 
distinct Fermat primes. In this section, it will be shown 
that for all such numbers rt, it actually is possible to 
construct the regular n-gon. The proof of this fact is rather 
intricate and may be skipped without loss of continuity. It 
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has been included here because the ingenious argument, 
due to Gauss, is not readily found except in the original 
text, even though it is rather elementary. In Section 3.7 we 
shall be able to construct a very short proof, based on 
much more elaborate theory, and it will be interesting to 
relate that proof to the one given here. The proof of the 
theorem in this section is followed by a detailed example, 
and the reader may find it useful to consult the example 
before, during, and after the study of the proof. 

To begin, the following two lemmas enable the general 
problem to be dealt with in simpler parts. 

Lemma 28a. If m and n are relatively prime, and if both 
the regular m-gon and the regular n-gon are constructible, 
then the regular mn-gon is constructible. 

Proof. By the hypothesis, angles of 2m / m and lir / n 
are constructible. We would like to show that an angle of 
2m /(mn) is constructible. By the Euclidean algorithm for 
Z, there exist integers s and t such that 


1 = sm + tn 



mn n m 


2m 2m , , 2m 

mn n m 

Therefore the angle 2m /(mn) is constructible, since it can 
be obtained as the sum of (possibly negative) multiples of 
constructible angles. (See Problem 3 for an analytic proof 
based on the roots of unity.) i 

Lemma 28b. For k > 2, the regular 2 k -gon is construct- 
ible. For k > 1, if the regular n-gon is constructible, then so 
too is the regular 2 k n-gon. 
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Proof. The first sentence is obvious, requiring only 
repeated bisections of 360°. The second sentence is al- 
most as obvious, requiring only repeated bisections of 
360° /«. i 

And now we are ready for the main result: 

Theorem 28. For n > 3 and of the form 2 k p [ p 2 • • • p m , 
where the p’s are distinct Fermat primes, the regular n-gon 
is constructible . (m may equal 0, in which case n has no odd 
factors.) 

Proof. In view of Lemmas 28a and 28b, it suffices to 
treat the case n = p, where p is a Fermat prime. Hence we 
may write p— 1 = 2 (2 "). We want to show that a primitive 
/7th root of unity is constructible. Denoting such a root by 
w, we recall that the entire set of primitive /7th roots of 
unity is given by w, u 2 , w 3 , . . . , w^ -1 . From the equation 
x p ~ 1 + x p ~ 2 + • • • + x + 1 = 0, which is satisfied by u 
(as well as by the other primitive /7th roots of unity), we 
conclude that the sum of all the primitive /7th roots of 
unity is - 1. 

In the course of our computations we shall frequently 
make use of the fact that oo ' = if and only if i = j (mod 
p), which follows at once from the definition of w. 

Let g be a primitive root modulo /?; we know that such 
primitive roots exist by Theorem 27. Since the numbers 
g, g 2 , g 3 , . . . , g p ~ l are all incongruent, and since none is 
congruent to 0, the primitive /7th roots of unity may also 
be represented by u g , u gl , w* 3 , . . . , w gP '. Because the 
exponents here and in what follows are frequently 
themselves exponentiated quantities, we shall adopt the 
abbreviation [k] for «*. Thus the set S of primitive pih 
roots of unity may be listed as [g], [g 2 ], . . . , [g p ~'\ In 
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referring to the elements of the set S as well as to the 
elements of other sets to be defined below, it will be 
convenient to make reference to the order in which these 
elements are listed. It is important to note that each 
element in this list is the gth power of the one before it, 
and that the first element is the gth power of the last. 
Consequently, we could list the primitive />th roots of 
unity by starting with any element in the list and 
successively raising it to the gth power until we had 
generated the whole list. For example, starting with [g y ] 
and doing this, we get [g'], [g 2+l ], [g ;+2 ], . . . , [g' -1 ], 
[g], . . . , [g^ _, ]> where some of the middle terms have 
been included to clarify what is going on. If we were to 
continue raising these elements to the gth power, the list 
would just start repeating itself. 

Let us denote by S, the set [g], [g 3 ], [g 5 ], . . . , [g^ 2 ], 
where we have taken the first and every other element 
of S. Similarly, denote by S 2 the set [ g 2 ], [ g 4 ], [ g 6 ], 
. ...[g^ -1 ]. Each of these sets may be generated by 
taking successive g 2 powers of any one of its elements. 
Indeed they could even be defined in this way. We could 
define 5, as the set generated by all the g 2 powers of the 
first element of S and S 2 as the set generated by all the g 2 
powers of the second element of S. (As with S, after going 
through sufficiently many g 2 powers, the lists would just 
start to repeat.) It may occasionally be helpful to think of 
and S 2 as having been formed by taking ‘every other’ 
element of S. 

Now, for i = 1, 2, we subdivide 5, into sets 5, , and 5, 2 
by an analogous process. In particular S t , would be 
generated by successive g 4 powers of the first element of 
S t , which has the effect again of taking every other 
element; likewise S, 2 would be generated by successive g 4 
powers of the second element of S,. 



108 


FIELD EXTENSIONS 


Ch. 2 


We should like to continue repeating this process of 
dividing sets in half until each set contains one single 
element. To give a precise inductive definition of the 
process, some more terminology is needed. An m-set is a 
set resulting from m divisions; it can be denoted by m 
subscripts, although in dealing with a particular m-set we 
may agree to suppress the writing of the subscripts. Thus 
5 is a 0-set, 5, and S 2 are 1-sets, and S 1> „ 5, 2 , S 2> j, and 
S 2 2 are 2-sets. Each m-set may be generated by successive 
g*" powers of any of its elements; and each such set 

contains (p - \)/2 m elements. Each m-set, S f ,■ , 

gives rise to two (m + l)-sets, ,• , , and 

i , 2 in the natural way. That is, S, ,• , , is the 

set generated by successive g (2 " +l) powers of the first 

element of S,-, (j im . Similarly, S,- im 2 is the set 

generated by successive g (2 ” +1) powers of the second 
element of S (| ,. The net effect, again, is to pick out 

every other element. The two (m + l)-sets resulting from a 
given m-set are said to be complementary. 

It is important to observe that the fact that /» — 1 is a 
power of 2 is critical to this procedure. If p - 1 were not 
such, then the repeated division into two equal subsets 
would not be possible, nor would the representation of 
sets as successive g 1 " powers of arbitrary elements. In the 
present case, after m = 2 N divisions, each m-set contains a 
single element, namely, some primitive pth root of unity. 

We denote by the period of a set the sum of its 
elements. It is an m-period if it is the period of an m-set. 
Two periods are complementary if they are the periods of 
complementary sets. Thus, each m-period is the sum of 
some (p — \)/2 m primitive /?th roots of unity; and, in 
particular, when m = 2 N , each such period is precisely one 
of the primitive pih roots of unity. Thus, the theorem will 
be proved if we can show that for 0 < m < 2 N , each 
m-period is constructible. We shall show this by induction 
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on m, the essence of the situation being that each 
( m + l)-period is the root of a quadratic equation whose 
coefficients are linear combinations of w-periods. (In fact, 
complementary periods are the two roots of the same 
quadratic equation.) 

To prepare for the argument, recall that if we know the 
sum and product of two numbers: say x x + x 2 = A and 
x x x 2 = B, then a, and x 2 are the roots of the quadratic 
x 2 — Ax + B. (Cf. Section 1.6.) Consequently, if A and B 
are constructible, then so too are x x and x 2 , since they 
may be obtained from A and B via the quadratic formula, 
in which the operations correspond to admissible 
constructions. 

We now enter into the main part of the proof that for 
each m, 0 < m < 2 N , all the m-periods are constructible. 
For m = 0, the only 0-period is the sum of all the primitive 
/?th roots, and it has already been noted that this sum is 
simply - 1, which is, of course, constructible. Let us 
assume that all periods through the ( m - l)-periods are 
constructible. Let tj, be an arbitrary m-period and tj 2 its 
complementary period. By the previous paragraph, it 
suffices to show that tj, + rj 2 and tj,tj 2 are constructible. 
Since rjj + ij 2 is precisely one of the (m — l)-periods, by 
the inductive hypothesis it is constructible. The con- 
structibility of tj,tj 2 will follow rather easily from the 
following claim. 

Claim. For 1 < m < 2 N — 1, tj|Tj 2 may be expressed as 
a linear combination, with non-negative integral 
coefficients, of all the w-periods. In this linear combina- 
tion, complementary periods have equal coefficients. For 
m = 2 N , t),t) 2 = 1. 

Proof. 7j, and rj 2 are the sums of the elements in two 
complementary m-sets S' and S". These two m-sets 
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resulted from the division of some one (m - l)-set, which 
is generated by successive g (2 ” ) powers of some 
particular element u k . Letting h = g (2 " \ this (m - l)-set 
may be written [Ac], [kh], [kh 2 \ . . . , [Ac/i^~ 2 ], where for 
convenience we have defined f = \ + (p - l)/ 2 m ~\ 
Subsequent powers of h just repeat the list, for 
hf- 1 = (£< 2 ’-y- 1)/2 - = gP-' = 1 (mod p). 

Thus we have 

V\=[k]+[kh 2 ]+ ■ ■ ■ + [kh f ~ 3 ], 
t/ 2 = [kh] + [AcAi 3 ] + • • • + [khf~ 2 ]. 

For m = 2 N , h = g (p ~ 1)/2 = - 1 (mod p), so that 17,173 = 
u k u~ k = 1 . For m < 2 *, we write out the product 17,173, 
grouping the terms appropriately (and remembering that 
[kh‘][khj] = [ kh ‘ + W] since [/] denotes «'): 

V1V2- [k + kh] + [kh 2 + kh 3 ] + • • ■ + [kh f ~ 3 + kh f ~ 2 ] 

+ [Ac + fc/i 3 ] + [kh 2 +kh 5 ] + ■ ■ ■ +[kh f ~ 3 + kh] 

+ [k + kh 5 ] + [kh 2 + kh 1 ] + ■ ■ ■ + [khf ~ 3 + kh 3 ] 


+ [k + khf~ A ] + [kh 2 + khf~ 2 ]+ ■ ■ ■ + [khf ~ 3 + kh^~ 6 ] 
+ [Ac + kh!~ 2 ] 4 - [kh 2 + kh] + • ■ ■ + [kh ^~ 3 + khf~ 4 ]. 



Sec. 2.7 


REGULAR POLYGONS II 


111 


Careful scrutiny of this sum will lead to the conclusions in 
the claim. First, each row of the sum is generated by the 
h 2 powers of any of its elements. Second, none of the 
exponents is congruent to 0 modulo p. To see the latter, it 
suffices to treat the first column. Suppose k + kh 2J+ 1 =0 
(mod p), where 1 < 2j + 1 < /- 2. Then we would have 

-l = h 2 J + ' (mod p) 

1 = (mod p) 

1 = ,)(4 -/ +2) (mod p), 

and so, since g is a primitive root modulo p, (p - 1) | 
(2 m ~ 1 )(4y + 2). But 

(p- 1) 

2 < 4/ + 2 < 2/- 4 = ~ 2> 


and so 


(2 m ~ 1 )(4y + 2) < 2(p -l)-2 m 
<2(p-l). 

Consequently, we would have to have 
p- \ = (2 m “ ')(4y + 2) 

= 2 m (2 j+ 1). 

Since p - 1 has no odd factors, the only value of j for 
which this might hold is j = 0; but then we would have 
p - 1 = 2 m < 2 (2 " - = (p — l)/2, which is impossible. 
Consequently each row is some m-period, although not 
necessarily (Remember, there are 2 m m- sets, each 
with a corresponding period.) Furthermore, the first and 
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the last row are complementary m-periods, for the hth 
power of any element of the first row is an element of the 
last row. For example, taking the //th power of the first 
element of the first row results in: 

[k + kh] h =[kh + kh 2 ] (law of exponents) 

= [kh 2 + kh] 

which is the second element of the last row. Since the 
other terms in these rows are generated by successive h 2 
powers of these, it suffices simply to find two terms 
related in this way, as has been done, Similarly, it is easy 
to see that the hth power of the first entry in the second 
row is the third entry in the next-to-last row, and so these 
rows are complementary. By analogous argument, which 
we do not formalize, it is seen that the /th row from the 
top is complementary to the /th row from the bottom. 
Since the number of rows is even, being (/- l)/ 2 , the 
rows can all be paired up in this way, and so each 
m-period occurs as often as its complement. This 
completes the proof of the claim. 

By the claim, t),t} 2 equals a sum of terms of the form 
R(r) + ij), where ■») and fj are complementary m-periods 
and R is a non-negative integer. But since r/ and rj are 
complementary, tj + rj is just some (m — l)-period, and 
hence constructible according to the inductive hypothesis. 
Being a sum of constructible numbers, must then 
itself be constructible. 

Having shown that 17, -I- ij 2 and 17,772 are constructible, 
we are able to conclude (as noted at the outset) that tj, 
and tj 2 are each constructible, which completes the 
proof. 1 
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As an example, let us reconstruct the proof of Theorem 
28 for the case p = 17. If co is any primitive 17th root of 
unity, then all the primitive 17th roots of unity are given 
by co, co 2 , . . . , co 16 . But if g is any primitive root modulo 
17, we can also write this list as co s , u* 2 , . . . , co*' 6 , which 
will simply be a reordering of the previous list. Taking 
g = 3, which is a primitive root modulo 17, the resulting 
list is co 3 , co 9 , co 27 , co 81 , . . . , co, which is made simpler by 
reducing each of their exponents modulo 17. In this way 
we obtain the following complete list: co 3 , co 9 , co 10 , co 13 , 
co 5 , co 15 , co", co 16 , co 14 , co 8 , co 7 , co 4 , co 12 , co 2 , co 6 , co 1 . We call 
this list S. 

In the course of the computations to follow, it is 
possible to adopt two courses. First, we could show that 
all the periods can be obtained as the roots of successive 
quadratic equations, from which the constructibility of the 
periods, and hence of the primitive 17th roots, follows. 
This is all that was necessary and all that was done in the 
proof of Theorem 28. However, this course does not yield 
specific directions on how to construct the polygon 
because it does not tell which root of each quadratic 
corresponds to which period. For example, we may be 
able to find a quadratic satisfied by two periods tj, and rj 2 , 
but further analysis is necessary to determine which root 
corresponds to rjj and which corresponds to tj 2 . 

For the purposes of this example, let us treat also this 
second problem, so that at the end we shall actually have 
a definite algorithm for constructing the regular 17-gon. 
Our analysis of this aspect will be geometrical and 
somewhat intuitive, but it should be clear how the steps 
could easily be translated into verifiable inequalities 
involving cosines. For definiteness, we take to = e 2m/l1 . A 
sketch of the location of the powers of co is given in Figure 
14, where the point co* is labeled by the exponent k. This 
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Figure 14 
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sketch, to which we shall make frequent reference, does 
not represent a construction; but we do know the 
numerical values of the relevant central angles as 
multiples of 2 tt / 17. 

Now we proceed along the lines of the proof of 
Theorem 28. The set 5 has period 17 = - 1, since the sum 
of the primitive />th roots of unity is — 1. S is now divided 
into two sets 5, and S 2 , with corresponding periods: 

77 , = co 3 + co 10 + co 5 + co 11 + co 14 + to 7 + co 12 4- co 6 , 

1)2 = CO 9 + CO 13 + co 15 + co ' 6 + co 8 + co 4 + w 2 + CO*. 

Clearly rj, + rj 2 = rj = - 1. By calculation we find 
= -4, which the reader might care to verify. Thus 17 , and 
i ) 2 are the roots of the quadratic x 2 + x - 4. Now we need 
to see which root is tj, and which is rj 2 . 

Since the summands in each period can be paired off 
into pairs of complex conjugates (co 3 and co 14 , <o 10 and to 7 , 
etc.), the periods may be rewritten as 

7 j, = 2 Re(co 3 4- co 5 4- co 6 4- co 7 ), 

tj 2 = 2 Re(co' 4- co 2 4- co 4 4- co 8 ), 

where Re denotes the real part of the given number. (Thus 
the periods are real.) From the geometrically evident 
inequalities, 

Re(co 2 ) - Re(co 3 ) > Re(co 7 ) - Re(co 8 ), 

Re(co') > Re(co 5 ), 

Re(co 4 ) > Re(co 6 ), 

which could also be established analytically, we see that 
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t ] 2 > tj,. Thus 


Vi = 


-1 — Vl7 

2 


- 1 +VTT 

2 • 

Proceeding to the next stage, we divide S, into S, , and 
2 , and S 2 into S 2 , , and S 2 2 ; the corresponding periods 
are 

7), , = M 3 + Ci) 5 + t*) 14 + CO 12 
7) 12 = W 10 + (O n + W 7 + 0) 6 
7) 2 , = W 9 + W 15 + W 8 + O) 2 
7)22 = W 13 + W 16 + W 4 + <0*. 

We see that tj, , + tj, 2 = tj,, which is known. By a 
calculation, tj, , 7 ), 2 = - 1. Thus, these two periods are the 
roots of x 2 - t),x - 1. By an analysis similar to that in the 
preceding paragraph, we determine which root is which. 
In particular, 

Tj, , = 2 Re(w 3 + w 5 ) 
t), 2 = 2 Re(w 6 + w 7 ), 

from which we see, using the diagram, that 77 , , > tj, 2 - 
Therefore, 

T), + \/r?i + 4 
■'ll, 1 = 5 - 
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Similarly, one finds that 

i ) 2 -yjvil + 4 
Vi, i = j 


1)2 + \/ l ) 2 + 4 

V2,2 2 ' 

It is perhaps worth noting that the products of the 
complementary 2-periods, as obtained in this step, do not 
contain explicitly the 1 -periods, as appears in the proof of 
the theorem. It simply happens to be the case here that 
the 1 -periods have the same coefficients themselves, and 
so the product is given in terms of the 0-period, ij = — 1 . 

The subsequent division of S, „ S, 2 , S 2 „ and S 2>2 
results in the eight 3-periods: 


1)1. 1, 1 = <o 3 + (0 14 

V 2 , 1, 1 

= w 9 + w 8 

Vl, 1,2 = « 5 + W 12 

Vl, 1, 2 

= W ' 5 + W 2 

„ .,10 1 7 

’ll, 2, 1 - W +w 

V 2 , 2 , 1 

= W ' 3 + « 4 

’ll, 2,2 = u" + <0 6 

Vl, 2, 2 

= to 16 + to 1 . 


Now we don’t really have to go to the trouble of finding 
all these; it turns out that it will be enough to find the last 
two, i) 2 2 , i and ij 2 2 , i- Their sum is i) 2> 2 , which is already 
known; and their product is ij, „ which is also already 
known. Thus they are the roots of x 2 — 1 ) 2 2 x + ij, 
From the observation that i) 2 2 , 2 > V 2 , 2 , i> we have 

V 2, 2 ~ V 7 !^, 2 — ^ 1 )i, 1 

1 ? 2 , 2 , 1 — > 
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The last pair of 4-periods are simply « 16 and Their 
sum is tj 2 2 , 2 and their product is 1 . We find then that 


u = 



Having now <o, we have of course essentially all its powers 
as well. 

In summary, the construction of co requires the 
construction of the following sequence of numbers: 


4i = 


_ -1 -t/17 


Vi 


_ -1 +i/T7 


4i, i = 


4 2 ,2 = 


4 i +ytf + 4 
2 

V 2 +V 1 ? 2 + 4 


42, 2, 2 = 


42 , 


2 + V42, 2 ^4l, 1 


42 , 2 , 2 


]/nl 2 ,2- 4 


Thus we have given an actual algorithm for the 
construction of the regular 17-gon. The reader may wish 
to go ahead and carry it out, but the author does not have 
the intestinal fortitude! 
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PROBLEMS 

1. Carry through the proof of Theorem 28 for the 
case p- 5, and deduce from it an algorithm for the 
construction of the regular pentagon. 

2. Generalize the method of this section to show that 
the primitive seventh roots of unity may be obtained by 
the successive solution of polynomials of degrees 2 and 3. 
(You are not asked to solve these lower degree poly- 
nomials.) 

3. If a is a primitive mth root of unity and if b is a 
primitive nth root of unity, and if (m, n) = 1, show that ab 
is a primitive mnih root of unity. Use this fact to give an 
alternate proof of Lemma 28a. 

*4. Find a necessary and sufficient condition on a 
rational number a such that an angle of a degrees is 
constructible. 

*5. Show that all the <p(n) primitive nth roots of unity 
are the roots of a single irreducible polynomial over Q 
which is monic and has integral coefficients. Use this 
result to give an alternate proof of Theorem 24. 


REFERENCES AND NOTES 

Further discussion about basic techniques and concepts used in this 
chapter may be found as follows: divisibility properties of polynomials 
and their relation with those of the integers, [1,2] and textbooks on 
abstract algebra, such as [3, 4]; vector spaces and linear algebra, [5, 6]; 
number theoretic techniques such as congruences and the Euler 
•►-function, [7, 8], The sufficient condition for constructibility of regular 
n-gons is due to Gauss [9], who made a significant step with the 
construction of the regular 17-gon (see [10]). Gauss claimed in [9] to have 
proved that the condition was also necessary, but he did not present it. 
Credit now generally is given to Wantzel [11] for this part, which may 
then also be considered to be essentially the first proof that an arbitrary 
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angle cannot be trisected. For further discussion see [12, 13], the latter 
noting a valiant ten-year project of Hermes to construct a regular 
65537-gon (?). There is a long history of proofs of the irreducibility of 
the nth cyclotomic polynomial. An excellent survey is contained in 
Section 13 of [14]. The proof in our solution to Problem 5, Section 2.7, 
comes from [3]; it is based on a famous eight-line proof by Landau [15]. 
Concise versions phrased in terms of polynomials over finite fields are in 
[16, 17]. 
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CHAPTER 3 


SOLUTION BY RADICALS 


Section 3.1. Statement of the Problem 

As every student of mathematics knows, the solutions to 
the quadratic equatio n ax 2 + b x + c = 0 are given by the 
formula x = (- b ±^b 2 - 4ac )/2 a. In this chapter we 
essentially ask: To what extent can similar formulas be 
found for polynomials of higher degree? 

To begin, the term “similar formula” needs to be given 
precise meaning. For our purposes, we shall consider a 
formula to be similar if it involves only a sequence of 
rational operations and the extraction of roots (of any 
integral order), beginning with the coefficients. Since, 
beginning with any nonzero coefficient, any rational 
number may be computed by a sequence of rational 
operations, certainly any rational constants may be used 
at any point in the procedure, as are 2 and 4 in the 
quadratic formula as given above. 

Even the word “formula” requires clarification, how- 
ever. We shall use it to mean some definite procedure 
which may be applied to a whole class of problems. For 
the quadratic case, the usual formula covers all quadratic 
polynomials. It turns out that there are similar formulas, 
albeit more complicated, for cubic and quartic poly- 
nomials. (See Problems 1 and 4.) But for polynomials of 
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higher degree the situation is entirely different, for it will 
be shown that for each n > 5, there exist polynomials of 
degree n which have roots that cannot be calculated by 
any sequence of rational operations and the extraction of 
roots, beginning with the coefficients! 

For the sake of definiteness, let us take a look at one of 
these polynomials, namely, f(x ) = 2x 5 — 5x 4 + 5. The 
graph of this polynomial for real values of x is sketched in 
Figure 15. There is a local minimum at (2, - 11) and a 
local maximum at (0, 5). It is easy to see that there are 
three real roots and hence two complex, non-real, ones. 
These roots are definite numbers, but, as will be shown 
later, not a single one of them can be computed by any 
sequence of rational operations or the extraction of roots, 
beginning with the coefficients! 

Thus, not only would it be hopeless to ask for such a 
formula that would work for all polynomials of degree 
five, say, but it would even be hopeless to ask for a 
formula that would work for the single polynomial 
2x 5 - 5x 4 + 5. Let us say that a polynomial is solvable by 
radicals if for each of its roots r there exists a sequence of 
rational operations and the extraction of roots through 
which r can be obtained from the coefficients. In light of 
our original question, it is now natural to ask: When is a 
polynomial solvable by radicals? 

This is a very deep question, which will finally be 
answered in Section 3.5. To get started, it is convenient to 
rephrase the definition of solvability by radicals in terms 
of field extensions. If / is a polynomial over a field F, the 
splitting field of f over F, is the smallest field containing F 
and all the roots of /. A field E is a radical extension of a 
field F if E = F (k) for some F such that k " G F for 
some positive integer n. That is, E is the simple algebraic 
extension of F by some nth root of an element of F. We 
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shall say that a polynomial / E F[x] is solvable by radicals 
over F if and only if there is a sequence of fields 
F = F 0 c F, c F 2 C • • • CF„ such that for each j, F- + , is 
a radical extension of F-, and such that the splitting field 
E, of /, over F, satisfies E c F^. In both the definition of 
splitting field and the definition of solvability by radicals, 
when explicit mention of the field F is omitted, we shall 
understand F to be the smallest field containing the 
coefficients of /; this field is called the coefficient field of f. 
With this convention, it should be clear that this 
field-theoretic definition of solvability by radicals is 
simply a reformulation of the definition given in the 
previous paragraph. 

Thus, the problem of solvability by radicals can be 
stated in terms of a problem about field extensions. When 
is the splitting field of a polynomial contained in some 
field F^ which can be obtained from the coefficient field 
by a sequence of radical extensions? 


PROBLEMS 

1. Show that the general cubic equation ax 3 + bx 2 + 
cx + d = 0 can be solved by radicals. (Hint: Without loss 
of generality, take a= 1. Make the successive substitu- 
tions x = y + L andy = z + K/ z for appropriate choices 
of L and K.) 

2. What is the degree of the splitting field of x 3 - 2 
as an extension of Q? Find a basis for this extension. 

3. Find an irreducible polynomial over Q whose 
splitting field has degree less than n\ over Q, where n is 
the degree of the polynomial. (Cf. Problem 8 of Section 
2 . 2 .) 

*4. Show that the general quartic equation ax 4 + 
bx 3 + cx 2 -I- dx + e — 0 may be solved by radicals. (Hint: 
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Without loss of generality, take a = 1 . Let y = x + (b /4), 
and then try to write the resulting equation as a difference 
of squares, making use of some extra parameter.) 

5. Show that there exists a number which has degree 4 
over Q but which is not constructible. (Hint: Consider the 
roots of f(x) = 4x 4 — 4x + 3.) 


Section 3.2. Automorphisms and Groups 

As noted at the conclusion of the previous section, we are 
faced with a problem about fields. When is the splitting 
field of a polynomial contained in some field which can be 
obtained from the coefficient field by a finite sequence of 
radical extensions? Fields are fairly complicated 
mathematical objects, containing in all cases under 
consideration here an infinite number of elements, which 
can be combined by means of two arithmetic operations 
(addition and multiplication) and their corresponding 
‘inverse’ operations (subtraction and division). The power 
of the theory evolving from the work of Galois derives 
from the way it enables us to rephrase our problem about 
fields into a problem about much simpler mathematical 
objects, called groups, which have only a single arithmetic 
operation and, in the present case, only a finite number of 
elements. 

The well-known mathematician Emil Artin (1898-1962) 
once wrote: “In modern mathematics, the investigation of 
the symmetries of a given mathematical structure has 
always yielded the most powerful results.” We shall see 
this theme borne out in the developments of this chapter. 
To begin, let us formulate an interpretation of the word 
“symmetries”. Most reasonable people would probably 
agree that the square is more ‘symmetrical’ than the 
general rectangle, both of which are sketched in Figure 16. 
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If asked to support this point of view, one might respond 
that there are eight distinct rigid motions of the square 
that will map it onto itself (reflections in each of the four 
dotted lines as well as rotations of 0°, 90°, 180°, and 270° 
about its center in, say, the counterclockwise direction), 
whereas for the rectangle there are only four (reflections in 
each of the two dotted lines as well as rotations of 0° and 
180° about its center). Thus it is quite natural to measure 
the symmetry of an object by looking at the different 
ways in which it can be mapped onto itself so as to 
preserve its structure, and this is precisely what we shall 
do with field extensions. 

If F is a field and if <j> is a function mapping F to F, we 
say that <f> is an automorphism of F if it is one-to-one, onto, 
and it ‘preserves the field operations’. By this latter phrase 
we mean that for any numbers a, b E F, <j>(a + b) 
= 4>(a) + <j>(b), 4>(a - b) = <p(a) - cf>(b), <p(ab) = <j>(a)<p(b), 
and <f>(a/b) = <f>(a)/^(b). It is rather simple to see 
(Problem 1) that any automorphism must leave each 
rational number unchanged. Furthermore, if a function <j> 
from F to F is one-to-one and onto and preserves addition 
and multiplication, then in fact it must preserve also 
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subtraction and division, and thus be an automorphism. 
We shall make frequent use of these facts. 

Clearly the identity mapping, which maps each element 
of F to itself, is an automorphism. As another example, 
the mapping of C given by complex conjugation, 
<j>(a + bi) = a - bi, a and b real, is an automorphism of C. 
To verify this, since <j> is obviously one-to-one and onto, it 
suffices to show that <f> preserves addition and multiplica- 
tion. We have 

<j>[(a + bi) + (c + di)] = <p[(a + c) + (b + d)i] 

= (a + c) — (b + d)i 


— (a - bi) + (c - di) 


= <j>(a + bi) + <|>(c + di), 

+ bi)(c + di)] = <>[(ac - bd) + (be + ad)i] 
= (ac - bd) - (be + ad)i 


= (a - bi)(c - di) 


= <p(a + bi)<p(c + di), 

which completes the verification. More examples will 
appear as we proceed. 

Since automorphisms are functions from F to F, it 
makes sense to form the composition <f> o \p of two such 
automorphisms. For brevity we usually write this new 
function as we have <t>\p(a) = <j>(\p(a)) for every a E F. 
The function <j>\p is frequently called the product of <j> and 
\p. For functions in general, the order in which they are 
composed makes quite a difference, and this is the case 
for automorphisms as well. In the exceptional circums- 
tance when we say that <j> and ip commute. At the 



130 


SOLUTION BY RADICALS 


Ch. 3 


same time, the composition of functions is always 
associative, so we always have t = (p(fr), where <f>, \p 
and r are automorphisms, and thus we can write the 
product unambiguously. Since an automorphism <p is 
one-to-one and onto, it has a unique inverse function, 
which we abbreviate <p~ and which satisfies <fxp~' = 
<p~'<p = identity mapping on F. 

Lemma 29a. If <p and are automorphisms of a field F, 
then so too are <p ~ 1 and <jnp. 

Proof, <f> -1 is obviously one-to-one and onto. We want 
to show that it preserves addition and multiplication. We 
have 

+ ^)] = a + b = </>[</>“ '(a)] + <|>[<J) _l (fe)] 

= <j>[4>-'(a) + <J>~'(6)] 
so by the fact that <p is one-to-one, 

<p \a + b) = <p '(a) + (j> ~ fb). 


Similarly, 

<>[<*“ ! («^)] = ab = 4 >[* -, («)] ’ 

and again by the fact that <j> is one-to-one, it follows that 
<t>~'(ab) = <p~ ] (a)- <p~'(b). 

With respect to (pip, to show that it is one-to-one, suppose 
that <p\p(a ) = <p^(b). Since <p is one-to-one, we must have 
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\p(a) = \p(b); since f is one-to-one, we must have a = b. 
This shows that <f np is one-to-one. That <fnp is onto should 
be clear. The preservation of addition and multiplication 
by <t>ip follows as below: 

H( a + b) = <J>[i//(a) + H b )] = <l>\p(a) + <j>ip(b) 
<j>ip(ab) = <t>[\p(a)\p(b)] = tj>\p(a)- <j>\p(b). ■ 

A set G of automorphisms of F is called a group of 
automorphisms if it has the following three properties: (i) 
G contains the identity automorphism, (ii) G is closed 
under composition (i.e., <f>, \p G G=></np 6 G), and (iii) G 
contains the inverse of every one of its elements. For our 
purposes, the word “group” will always refer to a group of 
automorphisms of a field. By the previous lemma and the 
remarks preceding it, the set of all automorphisms of a 
field F is a group; but there may be subsets of this set 
which are groups as well. For the field C, let id denote the 
identity mapping and <j> the mapping given by complex 
conjugation. Then {id, </>} is a group, but {<{>} is not. 

If (j> is an automorphism, a useful abbreviation is </>". 
When n is a positive integer <f> n stands for <p composed 
with itself n times; when n is a negative integer <p n stands 
for <f> -1 composed with itself |n| times; <j>° stands for the 
identity mapping. One reason why this abbreviation is so 
useful is that it obeys the usual laws of exponents: 
<f> m+n = < p m < f > n and (4>”) k = 4> nk , which follow directly from 
the definition. 

We shall only need to consider finite groups, that is, 
groups with a finite number of elements. In this case, we 
denote by | G \ the number of elements in G; it is called the 
order of G. Given a set of automorphisms, to decide 
whether it forms a group we might try to verify items (i), 
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(ii), and (iii) in the definition of a group. But if the set is 
finite, then not all these conditions need to be checked, 
for we have: 

Lemma 29b. If G is a finite set of automorphisms and if 
G is closed under composition , then G is a group. 

Proof. Given such a set G, let 4> be one of its elements 
and let n be the total number of elements. Since G is 
closed under composition, the list <j>, <#> 2 , <f> 3 , . . . , <J>' I+I 
must be a subset of G and therefore must contain some 
element twice. Suppose <j> J = <jd +k for some k > 1. Since 
<|> y+ * = <( d<f> k , we have — <jd<t> k . By the fact that <jP is 
one-to-one, it follows that <f> k is the identity auto- 
morphism, id. To show that G contains the inverse of <f>, 
we observe that if k = 1, <j> is its own inverse, whereas if 
k > 1, <t > k ~ 1 is the inverse of 4>. ■ 

If H and G are groups of automorphisms of a field F 
and if H c G, then we say that H is a subgroup of G. For 
a very simple example, H - {id} is a subgroup of any 
group. What kinds of subsets of a group might be 
subgroups? An important step in answering this question 
for finite groups is the following necessary condition, 
called Lagrange’s Theorem: 

Theorem 29. If G is a finite group and if H is a 
subgroup of G, then \H\ is a divisor of \G\. 

Proof. Let us define m = \H\. Then we can represent 
the (distinct) elements of H by the list: if/,, \j/ 2 , ... , \p m . If 
this list includes all of G, we are done, for then H = G 
and |H| = |G|. If there is some element <J>, of G not in this 
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list, we write down a second list below the first: 

'Pi 'Pi *p3 ■ ■ ■ 'Pm 

•Pl'Pl •Pi'Pi •Pi'Pj ■ ■ ■ <Pi 'Pm- 

The elements in the second list are products of elements 
of G and hence are elements of G. They are all distinct, 
since if then <j>f ’ t±> , ^ = tj>f and so 

ft = \pj. Also the two lists have no elements in common, 
since if <>,»//, = iL, then </>, = > pj\p~' £ H, by closure, which 
contradicts the choice of Now, if these two lists 
exhaust G, we are done, for then \G\ = 2\H\. But if there 
exists some element 4> 2 of G not in the list, we write a third 


list below the others: 



'Pi 

'Pi 

'Pi 

'Pm 

<f> i'Pi 

1 Pi'Pi 

•Pi'Pj 

Mm 

Mi 

• Mi 

M 3 • • 

■ ■ Mm- 


By the previous argument the elements of the third row 
are distinct among themselves, and they are distinct from 
the elements of the first row. They are also distinct from 
the elements of the previous row, for if c Mi = 'Pi'Pj, then 
4> 2 = ^I'Pj'Pr \ which belongs to the second row, since 
1 is some element of H. But this would contradict the 
choice of <f> 2 . Continuing to adjoin new elements to the list 
in this way, m at a time, we must eventually exhaust G, 
since G is finite. If at that point we have k rows, then 
mk = |G|. Thus \H\ divides |G|. I 

The set of elements in each row of the table used in the 
proof of Theorem 29 is called a left coset of H. The 
number of such cosets is given by \G\/\H\, which is called 
the index of H in G. 
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If G is a finite group and <}> e G, it was shown in the 
proof of Lemma 29b that for some integer k, <j> k = id. 
Letting k be the least such integer, look at the set of 
automorphisms {<£, <j> 2 , . . . ,<f> k = id}. It is a group! For, 
= <f> r , where r is the unique number between 1 
and k, inclusive, such that / + j = r(mod k). This group is 
a subgroup of G, and we call it the cyclic subgroup of G 
generated by </>. If there is an element <J> E G such that the 
cyclic subgroup generated by 4> is all of G, then we say 
that G is a cyclic group. Cyclic groups of automorphisms 
will be particularly important in our work. 

It was noted earlier that automorphisms do not always 
commute. However, in certain special cases they do. A 
group G is said to be Abelian or commutative if every pair 
of its elements commute. We have the following simple 
result: 

Theorem 30. If a group G is cyclic, then it is Abelian. 

Proof. By the definition of cyclic, G may be 
represented as {<£, <f> 2 , . . . ,<j>" = id}, where n = |G|. By 
associativity we have <j>'<y = = 4> J <f>', so that any two 

elements commute. a 

This section began with the assertion that in studying 
mathematical objects it often proves useful to study their 
symmetries, where a symmetry is a one-to-one mapping of 
the object onto itself which preserves the relevant 
structures. In the case of fields, we study groups of their 
automorphisms. Much can be learned about the structure 
of the fields in question by studying the structure of 
certain automorphism groups. By the ‘structure’ of groups, 
we mean things like how many and what kind of 
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subgroups they have, whether they are cyclic or not, and 
similar questions. 

It is worth remarking at this point that all the properties 
of automorphism groups developed in this section could 
have been defined or derived for any mathematical system 
having the same basic properties as such groups. This 
more general point of view may be found in textbooks on 
abstract algebra. 


PROBLEMS 

1. Suppose is a mapping from a field F to itself 
such that <|> is one-to-one, onto, and preserves addition 
and multiplication (i.e., <f>(a + b) = <j>(a) + <f>(b) and <j>(ab) 
= 4>(a)<j){b) for all a, b £ F). Show that 4>(0) = 0, <f>(l) = 1, 
and that <j> is an automorphism of F. 

2. Let <f> be an automorphism of a field of F. Show 
that for every rational number q, <j>(q) = q. (That is, </>, 
restricted to the subfield Q of F, is the identity mapping.) 

3. Find all the automorphisms of QC^2 ). 

4. Find all the automorphisms of Q(V2 ). 

5. Prove Theorem 29 using right cosets. (If H is a 
subgroup of G, a right coset of H is a set {^) where the 
\p t 's range through H and </> is a fixed element of G.) 

6. Let G be any group, finite or not, and let H be a 

subgroup of G. If <f> G G, then the set <(>H = \ $ E H } 

is called the left coset of H by <j>. Show that any two left 
cosets of G are either identical or disjoint. 

7. If G is a group whose order is a prime p, how 
many subgroups does G have? 

8. If G is a cyclic group of order n, determine exactly 
how many elements of G generate G. 
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9. If G is a cyclic group of order n, and if m | n, show 
that G has a cyclic subgroup of order m. 

10. Show that every subgroup of a cyclic group is 
cyclic. 

11. If |G| = p, a prime, show that G is cyclic. 

12. If H is a subgroup of prime index p in G , show 
that other than for H and G, there is no subgroup H such 
that H cH cG. 


Section 33. The Group of an Extension 

In this section we shall introduce those groups of 
automorphisms that will be of use in our work. If E is an 
extension of the field F, consider the set of all 
automorphisms of E which leave each element of F 
unchanged. This set is readily seen to be a group. First, it 
obviously contains the identity automorphism on E. 
Second, it contains the inverse of each member; for 
<p(a) = a if and only if a = <f>~\a) and so the 
automorphisms <j> and <f> ~ 1 leave the same elements 
unchanged. Third, it is closed under composition; if the 
automorphisms $ and \p leave every a e F unchanged, 
then so too does the automorphism \p<l>, for ft/fa) 
= i//(<f>(a)) = i p(a) = a. This group of automorphisms is 
called the group of E over F, and it is denoted G’(E/F). 

How many such automorphisms might there be? An 
infinite number? A finite number? What do they look 
like? Well, recalling the results of Problems 3 and 4 
of the previous section, there is only one element of 
G(Q( \^2 )/Q) and there are two elements of 
G(Q(V^)/Q). In order to answer these questions in 
general, we first need some terminology. If a number r is 
algebraic over F, then r is the root of some irreducible 
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polynomial over F, and this polynomial is unique up to a 
constant factor (Section 2.2). We denote the complete set 
of roots of this polynomial r = r,, r 2 , . . . , r n , where n is its 
degree. These roots are called conjugates of each other. 
They are distinct, since an irreducible polynomial cannot 
have multiple roots (Problem 10, Section 2.1). The simple 
algebraic extension F(r) may or may not contain any 
conjugates of r (other than r itself). The following theorem 
and its proof yield considerable information about the 
structure of the elements of G(E/F) when E is written as a 
simple extension of F. 

Theorem 31. If E = F(r), where r is algebraic over F 
with conjugates r = r x , r 2 , . . . , r n , then for each <j> £ 
G(E/F), <t>(r ) = some r ( ; moreover , for each r, £ E, there is 
a unique <f> £ G(E/F) satisfying tf>(r) = r,. If E is any finite 
extension of F, then |G(E/F)j < [E : F]. 

Proof. The second sentence will follow immediately 
from the first, since by Theorem 18, every finite extension 
is a simple algebraic extension, E = F(r), and the total 
number n of conjugates of r satisfies n = deg F r = [E : F]; 
so certainly there are at most n conjugates in E. 

Let <J> £ G(E/F). We shall make use of the fact that if 
p(x) = c 0 + c,x + c 2 x 2 + • • • + c k x k is any polynomial 
with coefficients in F, then 4>(p(rj) = p{<?{r)). To see this, 
observe that <t>(p(r)) = <j>(c 0 ) + $(c,)[<p(r)] + <Mc 2 )[<K'')] 2 + 
• • • + <H c fc)[ < H r )]* by the fact that <J> is an automorphism, 
and then from this that <t>(p(r)) = c 0 + c,[</>(r)] + 
c 2 [ < K r )] 2 + • • • + cJcK'O]' 1 = p(<j>(r)) by the fact that <j> 
leaves the elements of F unchanged. 

Now, if / is the minimal polynomial of r over F and if 
<t> £ G(E/F), then 0 = <>(0) = </>(/(r)) = f(<t>(r)) and so <j>(r) 
must be some conjugate r t of r. Conversely, if r, £ E we 
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want to show that there is a unique <p E G( E/F) such that 
<t>(r) = r ; . This is more complicated. 

By Theorem 17, any element a EE can be expressed in 
the form a = p(r) for some polynomial p E F[x]. (If we 
restrict the degree of p to be < n — 1, then this 
representation is unique, but this restriction is not 
convenient for our present purposes.) From the equation 
<p(a ) = 4>(p(r)) = p(4>(r)) it follows that an element 
<j> E G( E/F) is uniquely determined by its value <j>(r). 
Thus there is at most one <j> E G( E/F) such that <j>(r) = r t . 

In accordance with this computation, let us try to 
construct such a <j> by the natural rule. That is, for each 
a E E, write a = p(r) for an appropriate p E F[x] and then 
define <^a) = p(r t ). To see that this mapping is well 
defined, suppose a has two representations: a = p(r) 
= p(r), where both p and p belong to F[x]. We want to 
verify that p(r) = p(r : ). This follows from the fact that the 
polynomial p - p, which has rasa root, must therefore be 
divisible by the minimal polynomial f of r over F 
(Problem 6, Section 2.1). Since r, is also a root of f,p~p 
has r ( for a root. That is, p(r t ) = p{r). 

This mapping <j> leaves elements of F unchanged, for 
such elements may be represented by constant poly- 
nomials. That <p is onto follows from the fact (Problem 13, 
Section 2.2) that E = F(r,). In particular, any element of E 
can be represented in the form p(r,) for some p E F[.x], 
and p(r t ) is the image under <j> of p(r), which is in E. To 
see that <j> is one-to-one, suppose that <p(q(r)) = <j>(q(r)), 
where q and q are polynomials in F[x]. By the definition 
of </>, q(r t ) = q(r : ) and so, by reasoning exactly analogous 
to that in the previous paragraph, q(r) = q(r). Thus <j> is 
one-to-one. It remains to show that <j> is an automorphism, 
and by Problem 1 of Section 3.2, for this it suffices to 
show that it preserves addition and multiplication. 
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Let a and b be arbitrary elements of E, represented 
respectively, as p(r) and q(r), where p and q G F[x]. 
Letting t G F[x] be the product of the polynomials p and 
q, we have: 

4 tab) = <#»(/(/•)) = t(r,) = p(r,)q(r ,) = 4 >(a)<t>(b ). 
Letting s G F[x] be the sum of p and q, 

4>(a + b) = <(>(s(r )) = s(r f ) = p(r,) + q(r ,) = <j>(a) + <t>(b). 

Thus the mapping induced by the condition <f>(r) = r, is an 
automorphism. This completes the proof. 1 

Returning now to the examples preceding Theorem 3 1 , 
G(Q(vT)/Q) and G(Q(yT)/Q), of respective orders one 
and two, we note that in the first case the single 
automorphism not only leaves each element of Q 
unchanged, it leaves other things unchanged as well, 
namely all of Q(\f2). The second case is different, 
however, in that the set of elements left unchanged by 
every (i.e., both) element of the group is precisely Q itself. 
This is an important distinction. 

If <p is an automorphism of a field E, then the set 
K t ={flSE| <j>(a ) = a} is readily seen to be a field itself. 
For example, to verify closure under addition, let 
a, b G K^. Then <£>(a + b) = <j>(a) + <j>(b ) = a + b. Con- 
sequently a + b G K^. The other operations are handled in 
the same way. By Problem 2 of Section 3.2, contains 
all the rational numbers. The field is called the fixed 
field of <j>. If G is any set of automorphisms of E, then 
the intersection of their corresponding fixed fields, K = 
i% eG K^ ; is called the fixed field of G. It is, of course, a 
field since the intersection of an arbitrary collection of 
fields is always a field. We have just had one example 
where the fixed field of G(E/F) is E and another example 
where the fixed field of G(E/F) is F. Obviously, the fixed 
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field K of G( E/F) is always an intermediate field between 
E and F; that is, E d K d F. 

In the light of the preceding developments, there are 
certain natural questions one might ask. Under what 
conditions are the above relations a little ‘tighter’? For 
example, when do we have |G(E/F)| = [E : F]? When is 
the fixed field of G(E/F) simply F itself? Not only might 
these questions appear inherently interesting, but in fact 
they are very close to the core of the present investigation. 
Definitive answers will be given in the next section, but 
the interested reader may prefer to think about them 
independently first. 

If / is a polynomial over F and if E is its splitting field 
over F, then G(E/F) is called the Galois group of / over F. 
If the field F is not specified, it is understood to be the 
smallest field containing all the coefficients of /. The 
Galois group of a polynomial is the group whose structure 
will ultimately be of interest and for whose study we need 
to develop the more general theory of the groups of field 
extensions, as in this and the following sections. 
Determining the Galois group of a polynomial can be 
quite difficult, but fortunately we shall not have need of a 
general procedure for this. We have looked at one very 
simple case already, of course, namely f(x ) = x 2 -2. For 
the splitting field is Q(V2~ ) and G(Q(V^~ )/Q) has been 
shown to have two elements, the identity mapping and the 
mapping <p : a + b\[2 -> a - b^jl , a, b G R. Other ex- 
amples will appear in the problems and elsewhere. 
(Problem 1 will be very important for later work.) 

PROBLEMS 

1. If p is a prime, show that the Galois group of 
x p ~ 1 + x p ~ 2 + • - • + 1 = ( x p - l)/(x - 1) is cyclic and 
of order p — 1 . 



Sec. 3.4 


TWO FUNDAMENTAL THEOREMS 


141 


2. Determine the order of G( E/Q) where E = 
Q(\/2\ V3~ ). (Hint: See Problem 7, Section 2.3, and 
Problems 1 and 2, Section 1.4.) 

3. What is the order of the Galois group of x 5 — 1? 

4. Is there some polynomial / G Q[x] such that 
Q(^2 ) is its splitting field? 

5. If <(> G <j(E/F), where E is a finite extension of F, 
and if K is an intermediate field, show that the set 
<KK) = (<Ka) | a G K} is also an intermediate field. Show 
further that [K : F] = [<f>(K) : F]. 

6. Let F be a field. Numbers a { , a 2 , ■ ■ ■ , a„ are said 
to be algebraically independent over F if there is no 
nontrivial polynomial p(x t , x 2 , . . . , x n ) over F such that 
p(a t , a 2 , . . . , a n ) = 0. Show that if F is countable, then for 
any n there exist n algebraically independent elements 
over F. 

7. Let a v a 2 , . . . , a n be algebraically independent 
over a field F. Define E = F(a,, a 2 , . . . , a n ). Show that for 
every permutation of the numbers a u a 2 , . . . , a„ there 
exists a unique $ G G(E/F) which has the effect of this 
permutation on the a/s. 

8. How many automorphisms of R are there? 

*9. If the field of complex numbers C is an algebraic 
extension of a field E, show that any automorphism of E 
can be extended to an automorphism of C. 

*10. How many automorphisms of C are there? 

Section 3.4. Two Fundamental Theorems 

If E is a finite extension of F with the property that every 
irreducible polynomial over F that has one root in E has 
all its roots in E, then E is said to be a normal extension of 
F. At first, this may be a difficult property to appreciate; 
it is perhaps best clarified by moving directly to the 
following major theorem: 
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Theorem 32. Let E be a finite extension of F. Then the 
following statements are equivalent : 

(i) E is a normal extension of F. 

(ii) E is the splitting field of a polynomial over F. 

(Hi) | G(E/F)| = [E : F]. 

(iv) F is the fixed field of G(E/F). 

(v) E is a normal extension of every intermediate field 
K, E D K D F. 

Proof. In order to show the equivalence of the above 
statements, the seven implications represented by arrows 
in the following diagram will be established: 



The implication (i=>ii) is not logically necessary to the 
proof, but as it is very brief and helps to clarify the 
relation between the statements, it will be included. 

(i=*ii) Since E is a finite extension of F, by Theorem 
18, E is a simple algebraic extension, E = F(r). If / is the 
minimal polynomial for r over F, E is the splitting field of 
/. For, if r = r x , r 2 , . . . , r n are the conjugates of r, 
E = F(r) c F(r,, r 2 , . . . , r n ) c E, the last inclusion follow- 
ing from normality. 

(ii => v) Let E be the splitting field of a polynomial / 
over F with roots r,, r 2 , . . . , r n . Then E = F(r,, r 2 , . . . , r n ) 
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and we may build up to E by a sequence of simple 
extensions: F, = F(r,), F 2 = F,(r 2 ), F 3 = F 2 (r 3 ), . . . , F„ 
= F„_,(r n ). A basis for the extension F- over F y _ , was 
shown in the proof of Theorem 17 to be 1, r jy rj, 
. . . , \ where m = deg F which may be anything 

from 1 to n. As in the proof of Theorem 19, a basis for the 
extension E over F may be obtained by forming all 
products such that one factor is chosen from the basis of 
each of these single simple extensions. We conclude that 
each element a E E may be written in the form 
a = P(r v r 2 , , r„), where P is a polynomial over F in n 
variables. 

Now we apply these facts to show that if g is an 
irreducible polynomial over a field K, where F c K c E, 
and if g has one root a E E, then all its roots must be 
in E. By the previous paragraph, we may write 
a = P(r x , r 2 , . . ■ , r n ), where P is a polynomial over F in n 
variables. Now look at all the values of P under all 
permutations of r. s; that is, if tt i is one of the n\ permu- 
tations on the numbers 1 through n, define a, = 
P(r„ x i), r„ j(2) , . . . , r„ M ). Taking tt, as the identity permu- 
tation, the resulting values are a = a v a 2 , . . . , a nl , each of 
which is in E. We will show that g divides the polynomial 
h defined by h(x) =I1"L i (a - a,), and hence that all its 
roots are included among those of h , all of which are in E. 

To show that g divides h, we first show that h has all of 
its coefficients in F. Letting a, denote the /th elemen- 
tary symmetric function of n\ variables and letting 

p„( x I, *2- • • • . X n) = p ( x n,oy x *,( 2 y_ • • • . ^ l( „))> we have 
that the coefficient of x n] ' in h is simply 
± a i( p «y Ky • • • > K n ) evaluated at (r„ r 2 , . . . , r„). How- 
ever the polynomial function of n variables given by 
Oj(P Tl , p „ 2 , ■ • • , p „J is a symmetric polynomial over F, 
for any permutation of the variables simply permutes the 
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P' s, under which o, is invariant. Therefore, by Corollary 
9a the value of , . . . , P n J when evaluated at the 

roots (r,, r 2 , . . . , r n ) of a polynomial over F, is in F. Since 
we now have h E F[x], certainly h E K[x]. Now both g 
and h E K[x], they have one root a in common, and g is 
irreducible. It follows (Cf. Problem 6, Section 2.1) that g 
divides h in K[x]. As noted at the outset, this implies that 
all of the roots of g, being among those of h , are in E. 
Thus E is a normal extension of K. 

(v =* i) This is obvious, as F itself is an intermediate 
field. 

(i =s> iii) Since E is a finite extension of F, we know we 
can write E = F(r), where r has conjugates r x , r 2 , . . . , r n . 
Also, we must have n = [E : F]. By normality, each r, £ E. 
By Theorem 31, corresponding to each r t there is exactly 
one element of G(E/F), and there are no other elements 
of G(E/F). Thus | G(E/F)| = [E : F]. 

(iii => ii) As in the previous paragraph, r must have the 
same number of conjugates in E as there are auto- 
morphisms in G(E/F). So if | G(E/F)| — [E : F] = n, all n 
conjugates of r must be in E. Thus E is the splitting field 
for the minimal polynomial of r over F. 

(iii=>iv) Let K be the fixed field of G(E/F). Clearly 
E d K d F. By (v), which has been shown to be equivalent 
to (iii), E is a normal extension of K. Since i => iii, 
|G(E/K)| = [E : K], But G(E/K) = G(E/F), since every 
automorphism of E which leaves F fixed also leaves K 
fixed. Thus we have: [E : K] = |G(E/K)| = | G(E/F)| 
= [E : F], By Theorem 19, [E : F] = [E : K][K : FJ; it 
follows that [K : F] = 1 ; that is, K = F. 

(iv => iii) Again we begin by writing E = F(r). Let 
n = [E : F] = deg F r, so that any polynomial over F of 
which r is a root must have degree > n. Letting 
m = |G(E/F)|, we will construct a polynomial of degree m 
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over F having r as a root, from which we will thus 
conclude m > n. By Theorem 31, then, m = n. 

Let </>,, <j> 2 , . . . , <f> m be the elements of G( E/F). We will 
show that the polynomial h(x) = n"Li(* - <j>,(r)) has all 
its coefficients in F by showing that each coefficient is 
unchanged by every <j> E G(E/F). The coefficient of x m ~‘ 
is ±o i (<j> l (r), <f> 2 (r), . . . , <f> m (r)), where o, is the ith 
elementary symmetric function in m variables, a 
symmetric polynomial over Q and hence over F. Letting 
<J> be an arbitrary automorphism in G(E/F), 
<*>k(4>i(0> 4>i(r), .... <|> m (r))] = o,(H,(r), <j>4> 2 (r), . . • , 
<M> m (r)) = 4 > 2 (' - ). • • • > )) since o, is symmetric 

and the list <fxj > 2 , . . . , representing all of 
G(E/F), must be a permutation of the list <p t , <f> 2 , , <f> m . 
Since <p is arbitrary, each coefficient in h, being un- 
changed by every element of G(E/F), must be in F. But r 
is a root of this polynomial, since the identity 
automorphism must be in G(E/F), and so m > n. As 
noted earlier, this suffices to complete the proof. ■ 

As a corollary, we obtain a natural one-to-one corre- 
spondence between intermediate fields and subgroups of 
G(E/F), called the fundamental Galois pairing: 

Corollary 32. If E is a normal extension of F, there is 
a one-to-one correspondence between the intermediate fields 
K, E D K D F, and the subgroups of G(E/F); this 
correspondence associates K with G( E/K). The correspon- 
dence reverses inclusion relations: K, c K 2 if and only if 
G(E/K,)DG(E/K 2 ). 

Proof. With each intermediate field K we associate the 
group G(E/K); K is the fixed field of this group since by 
Theorem 32 E is normal over K. Every subgroup of G(E/F) 
is associated with some such field, namely its fixed field. In 
particular, if H is a subgroup of G(E/F) with fixed field K, 
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it is easy to verify that H must be all of G(E/K) by 
repeating, essentially verbatim, the (iv => iii) part of the 
proof of Theorem 32, replacing G( E/K) by H and F by K 
throughout. This shows that |//| = [E:K]. At the same 
time, we know from Theorem 32 that |G(E/K)| = [E:K], 
and by the definition of G(E/K) that H c G(E/K). Hence 
H = G(E/K). If G( E/KO = G(E/K 2 ), then the two groups 
have the same fixed field. By normality of E over both K x 
and K 2 , K 1 = K 2 . That inclusion relations are reversed 
follows directly from the definitions of fixed field and 
group of an extension. I 

In the absence of the assumption that E is a normal 
extension of F, the above correspondence would no longer 
be one-to-one, for several fields might have the same group 
associated with them. For example, if E = Q(v^2 ), then 
both E and Q have the same group associated with them 
(only the identity automorphism on E). 

From Theorem 32 and its corollary, we can begin to get 
some feeling for the properties of normal extensions and 
for the correspondence between fields and groups. How- 
ever, there is still an obvious open question: If E is a 
normal extension of F, under what circumstances is an 
intermediate field K also a normal extension of F? While 
we know that E must be a normal extension of K, certainly 
K need not be a normal extension of F. For example, if E 
is the^splitting field of x 3 - 2 over Q, one intermediate field 
is Q(/2 ), and we have already seen (Problem 4, Section 
3.3) that this is not a splitting field of any polynomial over 
Q, and hence by Theorem 32 it cannot be a normal 
extension of Q. An alternate way to conclude that Q(v^) 
is not normal over Q is to observe that |G(Q(v^)/Q)| = 
1^3 = [Q(v^):Q]. 

Intermediate fields K, EdKdF, where K is normal 
over F are very important in the problem of solvability by 
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radicals. It will be seen below that such fields correspond 
to a special kind of subgroup of G(E/F). 

If H is a subgroup of a group G, and if $ G G, let us 
define the set </> - l H<j> = {<t>~ | <^ E # }• It is a subgroup 

of G. For, since id E H, (j> _1 id<j> = = id E<j> _ 'H<p\ 

since \p~' E H, (<j>~ G <p~'H4>; and if 

ip, r E H, (<j> ~ _ V^)) = <p ~ l (\f/T)<l> G <f>~ Also, 

since G is closed, c G. The subgroup <p~'H<f> is 

said to be a conjugate subgroup of H. Two subgroups //, 
and H 2 of G are said to be conjugate if //, =<p~'H 2 <t> for 
some <f> E G. The relation of conjugacy is easily seen to be 
an equivalence relation on the set of subgroups of G; it is 
reflexive, symmetric, and transitive. (See Problem 3.) A 
subgroup H of G is said to be a normal subgroup, or an 
invariant subgroup, if <p ~ 'H<p = H for every <j>G G; that is, 
if H equals each of its conjugates. 

Two extensions K, and K 2 of a field F are said to be 
conjugate over F if K, = F(r,) and K 2 = F(r 2 ), where r, 
and r 2 are conjugates over F, that is, r, and r 2 have the 
same minimal polynomial over F. Under the fundamental 
Galois pairing, conjugate fields will be shown to 
correspond to conjugate subgroups, and this fact, called 
the Fundamental Theorem of Galois Theory, is at the 
core of the solution to the problem of solvability by 
radicals. It is also of intrinsic interest because of the 
structural questions it clarifies. But in order to deal 
conveniently with conjugate fields, we first need the 
following preliminary result: 

Lemma 33. Let E be a normal extension of F. 
Intermediate fields K, and K 2 are conjugate if and only if 
there exists some <f> G G(E/F) such that <J>(K,) = K 2 . 

Proof, (if) Assume that there is such a <p G G(E/F) 
and write K, as a simple algebraic extension of F, 
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K, = F(r ,). Then <£(/-,) must be a conjugate of /•„ for if / is 
the minimal polynomial of r, over F, 0 = <Kf ( r \)) = 
f( < t>( r i))- Let r 2 = <j>(r,) E K 2 . Clearly F(r 2 ) c K 2 . To show 
equality, we have [K 2 : F] = [K, : F] = deg F r, = deg F r 2 
= [F(r 2 ) : F], where the first equality follows from 
Problem 5 of Section 3.3. But since it is also true that 
[K 2 : F] = [K 2 : F(r 2 )][F(r 2 ) : F], we conclude that 
[K 2 : F(r 2 )] = 1. That is, F(r 2 ) = K. Hence K, and K 2 are 
conjugate fields. 

(only if) If K, = F(r,) and K 2 = F(r 2 ), where r, and r 2 
are conjugates of each other, we want to show that there 
exists a. 4>G G( E/F) such that <KK,) = K 2 . Suppose 
[K, : F] = «, so there are n conjugates /•„ r 2 , . . . , r n . It is 
actually easier to establish the stronger result that for each 
rj there are exactly [E : K,] elements <j> of G(E/F) such 
that <;>(/•,) = rj. Let r be fixed and such that there is at 
least one ip £ G(E/F) for which 4 / ( r i) = f )• (Initially, we 
do not know whether any such rj s exist.) Let 
Sj = (<f>G G( E/F) | <f>(r,) = r^) and observe that the set 
\p~'Sj = | G Sj} is just G(E/K,). For, each 

element of i p~'S leaves K, unchanged, and any 
< j> E G(E/K[) can be written <p = ^ -1 (^c|>), in which it is 
clear that \p<() E Sj. Thus is the left coset i|/G(E/K,). If m 
is the number of conjugates of r ] for which there are such 
nonempty sets Sj, each of which is now known to have 
exactly |G(E/K,)j elements, we have (since every <#> is in 
some Sj 

|G(E/F)| = m|G(E/K,)|. 

By the normality of E over F and over K„ 

[E : F] = m[E : K, ], 

from which it follows that m — [K, : F] = «. 
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By the result of the previous paragraph, <£(K,) D F(r 2 ); 
but since every element of <#>(K,) is a polynomial over F 
evaluated at r 2 , in fact </>(K,) = F(r 2 ) = K 2 . ■ 

And now for the Fundamental Theorem of Galois 
Theory: 

Theorem 33. Let E be a normal extension of F. Two 
intermediate fields K, and K 2 are conjugate over F if and 
only if G(E/K,) and G( E/K 2 ) are conjugate subgroups of 
G(E/F). K is a normal extension of F if and only if 
G( E/K) is a normal subgroup of G( E/F). 

Proof. If K, and K 2 are conjugate over F, then by the 
preceding lemma there exists an automorphism <j> 
E G( E/F) such that <KK,) = K 2 . Using the abbreviation 
H = G(E/K 2 ), it will follow easily that G(E/K,) 
— For if \p E G(E/K,), then f 

and for every a G K 2 , <t>f<p ~ fa) = ~ ’(«)] '( a ) 

= a, since ip leaves ^ -l (a)eK, fixed. Therefore 
~ 1 e G(E/K 2 ). Conversely, if ip G H = G(E/K 2 ), then 
for every aeK,, <j>~ , ip<f)(a) = [<>(«)] = <l> ~'<j>(a) = a, 

so that <j>~'ip<j>G G(E/K,). Thus G(E/K,) and G(E/K 2 ) 
are conjugate subgroups of G(E/F). 

On the other hand, if G(E/K,) = where 

H = G(E/K 2 ), we shall conclude that £(K,) = K 2 . For if 
a G K„ but <Ma)& K 2 , then <jfa) would not belong to the 
fixed field of H. Thus there would exist some f G H such 
that ^<Ka) ¥* <p(a). But then <j>~ l \p(p(a) j= a, contrary to 
the hypothesis. Thus <KK,) C K 2 . By a similar argument, 
<#> - *(K 2 ) c k,, and so K 2 c <>(K.,). Consequently, ^(K,) 
= K 2 

With respect to normality, recall that by the funda- 
mental Galois pairing, there is a one-to-one correspon- 
dence between the intermediate fields and the subgroups 
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of G(E/F). As shown above, conjugate fields correspond 
to conjugate subgroups. Now K is normal over F if and 
only if K has no conjugate field K # K. For if K is 
normal, it contains all the conjugates of any of its 
elements, by the definition of normality; and conversely, 
if K does contain all the conjugates of every one of its 
elements, it is the splitting field of a minimal polynomial 
of any r such that K = F(r). Thus, K is normal over F if 
and only if G(E/K) has no conjugate subgroup other than 
itself; that is K is normal over F exactly when G(E/K) is a 
normal subgroup of G(E/F). ■ 


PROBLEMS 

1. If F, = F(r,) and F 2 = F(r 2 ) and if F, and F 2 are 
conjugate fields over F, does it follow that r, and r 2 are 
conjugates over F? 

2. If E is a finite extension of F, show that there are 
only a finite number of intermediate fields K, E D K D F. 
(This already appeared as Problem 12 of Section 2.2, but 
an alternate and much simpler proof is now accessible.) 

3. Show that conjugacy is an equivalence relation on 
the set of subgroups of a group. (Given a set 5 and a 
relation holding between certain pairs of elements of 
5, this relation is called an equivalence relation if it is 
reflexive [for every a £ S, a ~ a], symmetric [if a ~ b, 
then also b ~ a] and transitive [if a ~ b and b ~ c, then 
a~~ c]. 

4. Show that conjugacy is an equivalence relation on 
the set of finite extensions of a field F. 

5. If E is a normal extension of F and if K is an 
intermediate field, can every automorphism in G(K/F) be 
extended to an automorphism of E? 
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6. Let E be an extension of F, and let <p be an 
automorphism of F. Can <> necessarily be extended to an 
automorphism of E: (a) in general? (b) if E is a finite 
extension of F? (c) if E is a normal extension of F? 

7. Let /•) and r 2 be conjugates over a field F and 
suppose that r 2 ^F(r,). Prove or disprove: F(r,) n F(r 2 ) 
= F. 


Section 3.5. Galois’ Theorem 

At the beginning of this chapter, we undertook to study 
the question of when a polynomial is solvable by radicals. 
Let us review the major developments so far. First, the 
question was formulated in terms of fields. Letting / be a 
polynomial in x and letting F be the smallest field 
containing its coefficients, then the question is this: When 
is there a sequence of fields F = F 0 c F, c • • • C F^, 
each a radical extension of the previous one, such that the 
splitting field E of / satisfies E c F^? Subsequent sections 
dealt with structural questions in the theory of field 
extensions and especially with their corresponding notions 
in the theory of groups. In particular, for any fields E and 
F such that E is a normal extension of F, a specific 
one-to-one correspondence was set up between in- 
termediate fields K, E D K D F, and subgroups of G(E/F). 
Under this correspondence, called the fundamental Galois 
pairing, those intermediate fields K which are themselves 
normal extensions of F were seen to correspond to 
subgroups G(E/K) c G(E/F) which are actually normal 
subgroups of G(E/F), and conversely. It has been 
asserted that this correspondence will enable us to treat 
problems about fields by studying the corresponding 
questions about groups, which are simpler. (One place 
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where this has already been done is in Problem 2 of 
Section 3.4.) In this section we shall fully utilize this 
approach in order to answer the original question 
concerning solvability by radicals. 

If G is a finite group, then G is said to be solvable if 
there is a sequence of subgroups G = H 0 d //, d • • • 
D H n = (id) such that each is a normal subgroup of 
prime index in the previous one; that is, if for each j, 
0 < j < N - 1, H J+] is a normal subgroup of Hj and 
\Hj\/\H j+i \ is a prime number. Such a sequence of 
subgroups is called a composition series for G with prime 
factors, the prime factors being the numbers \H\/\H /+ ,|. 
(Incidentally, if H, + l is normal in // , this does not imply 
that it is normal in H i for any i , 0 < i < j — 1 . See 
Problem 9.) In this section we shall prove the famous 
Theorem of Galois: 

Theorem 34. A polynomial is solvable by radicals if 
and only if its Galois group is solvable. 

Although the deepest results needed in the proof of this 
theorem have already been established in the previous 
section, there is still a fair amount of work to be done. 
Consequently, the proof presented here will be given in 
the form of a sequence of lemmas which should serve as 
an outline of the main ideas. The reader is advised not to 
become immersed in the details of the individual proofs 
before trying to understand the general form of the devel- 
opment. 

Why should the solvability of the Galois group of a 
polynomial / have anything to do with the solvability of / 
by radicals? The best way to answer this seems to be to 
start on the “if” part of the theorem, treated in Lemmas 
34a through 34d. Throughout, we shall denote by F the 
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smallest field containing the coefficients of / and by E the 
splitting field of /. 

Lemma 34a. If f has a solvable Galois group, then there 
is a sequence of fields F = F 0 c F, c • • • CF A , = E such 
that for each j, 0 < j < N — 1, F - + , is a normal extension 
of F '■ of prime degree. 

Proof. By the solvability of the Galois group 
G = G(E/F), there is a sequence of subgroups G = H 0 
D //,□••• D H n = {id}, each a normal subgroup of 
prime index in the previous one. By the fundamental 
Galois pairing, Corollary 32, their corresponding fixed 
fields may be written F = F 0 c F, c • • • c F^ = E, 
where F f is the fixed field of Hj. Being a splitting field of a 
polynomial over F, E is a normal extension of F, and so it 
is also a normal extension of each intermediate field F y 
(Theorem 32); thus Hj = G(E/F / ). Since H j+X is a normal 
subgroup of Hj, Theorem 33 implies that Fy +1 is a normal 
extension of F . The degree of this extension is 
[F y+1 : Fj] = [E : F y ]/[E : F /+1 ] = |G(E/F y )|/|G(E/F y+1 )| 
= \ H:\f\H /+1 |, which is the index of H /+1 in // and hence 
a prime. a 

Lemma 34b. If K is a normal extension of prime degree 
p over a field K which contains the pth roots of unity, then 
there is a collection of p — 1 numbers a { , a 2 , . , a p _ x such 

that for each i, af e K, and such that K c K(a,, a 2 , 

■ ■ • > v,). 

Proof. The lemma simply asserts that every element in 
K, a normal extension of prime degree over K, can be 
obtained from elements of K using only rational 
operations and the extraction of /?th roots. 
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Since K is normal over K, we can write K = K(r), where 
all the conjugates of r also belong to K. We shall want to 
number these conjugates in a certain way. First observe 
that G( K/K) is of prime order p by normality; and it is 
cyclic, since by Lagrange’s Theorem, the cyclic subgroup 
generated by any element <£ ^ id must be the entire 
group. Thus we may write G(K/K) = {<f>, tp 2 , . . . , <j> p }. As 
noted many times before, $ must map conjugates of r to 
conjugates of r, because if g is the minimal polynomial of 
r over K, then 0 = 4>(g(r )) = g (<£(/•)). Define r i = for 
1 < i < p\ then these r- s must all be distinct because the 
<j> n s are all distinct and the action of </>' on r uniquely 
determines its action on all of K = K(r). Since [K : K] = p, 
r has exactly p conjugates, and so r,, r 2 , . . . , r are the 
entire set of roots of g. Letting u be a primitive pih. root of 
unity, define 

cij - + u J r 2 + u 2 J r 3 + ■ ■ ■ + u ( r- x)j r p 

for 0 < j < p - 1. Writing these out for the different 
values of j, we obtain p linear equations in r, through r p : 

a o~ r \ + r 2 + r 3 + • • ■ + r p 

a, = r, + wr 2 + w 2 r 3 + • ■ • + 

a 2 = r, + u 2 r 2 + w 4 r 3 + • • • + o> (p ~ x) 2 r p 

a p _ l = r, + u (p -' ) r 2 + u 2 (p ~^r 3 + • • ■ + w (p ~ r p 

Considering the r- s as the ‘unknowns’, we will show that 
this is a nonsingular system of equations and that for each 
j, {ajf E K. Once the former fact has been established, it 
will follow from the elementary theory of linear equations 
that the r- s may be obtained from the numbers a p and to 
via rational operations. 
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First observe that a 0 6K since it is an (elementary) 
symmetric polynomial evaluated at the roots of a 
polynomial g over K. For each j, 1 < j < p — 1, we show 
that ( ajY G K by showing that (ay is unchanged by every 
automorphism of G(K/K), from which it follows that it is 
in the fixed field of G(K/K), which is K. First we 
calculate <J>[(a ; y’]: 

<f> [(«/] = 

= [<j>(r,) + coAf>(r 2 ) + u 2j <t>(r 3 ) + • • • + 

= [r 2 + co'r 3 + co 2 -V 4 + • • • + w 0’->Vr 1 ] / ’ 

=«• 

Consequently, we see that for all /, §'[(ay\ = (c^y. Thus, 
(a/GK. 

To show that the system of equations is nonsingular, 
observe that the determinant of the coefficient matrix is a 
Vandermonde determinant: 


1 

1 

i 2 

l 3 

l' -1 

1 

CO 

co 2 

CO 3 

co/’" 1 

1 

co 2 

(CO 2 ) 2 

(CO 2 ) 3 . 

.. (co 2 f-' 

1 

<y-' 

(«'“ ’) 2 

K" 1 ) 3 ■ 

1 

1 


= II (co'-co'), 

o < i <j < p - 1 

which is nonzero since co' ^ co ' for 0 < i < j < p — 1 . 
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Since the system is nonsingular, the values of the r ’ s can 
be calculated from the aj s and co by means of Cramer’s 
Rule, for example, and hence by rational operations. 
Thus, for each i, r, G K(a„ a 2 , ■ ■ ■ , a p -\)\ and so 
K = K(r) c K(a„ a 2 , . . . , a p _ ,). I 

In order to apply Lemma 34b, we need to modify the 
sequence of fields in Lemma 34a so that they contain the 
relevant roots of unity. The key to this step is the 
following: 

Lemma 34c. If K is a normal extension of prime degree 
p_ over a field K and if to is a primitive pth root of unity, then 
K(to) is a normal extension of degree p over K(w). 

Proof. Consider the following diagram: 



K(w) K 



in which a line indicates that the upper field is an 
extension of the lower one, and where the letters m and p 
denote the degrees of those extensions. Since p divides 
[K(«) : K][K : K] = [K(w) : K] = [K(w) : K(w)][K(w) : K], 
and since [K(w) : K] = deg K w < deggw = p — 1, it follows 
that p must divide [K(co) : K(w)]. Hence [K(to) : K(w)] > p. 
However [K(co) : K] = deg^w < deg K co = m, so that p 
> [K(w) : K(u)]._Thus [K(w) : K(w)] = p. 

__ To see that K(co) is a normal extension of K(w), write 
K = K(r) and suppose that g is the minimal polynomial of 
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r over K. Then K(to) is the splitting field of g(x)(x p - 1) 
over K, and hence it is normal over K and K(w) as well. 
(Alternatively, one could say that K(to) is the splitting field 
of g over K(a>).) | 

By putting together the preceding lemmas, we are able 
to complete the proof of the first half of the theorem: 

Lemma 34d. If G(E/F) is solvable, then f is solvable by 
radicals. 

Proof. Consider an arbitrary pair F y and F\ + , of 
successive fields in the sequence guaranteed by Lemma 
34a. If [F y+ , : F ] = p and if u is a primitive p th root of 
unity, then by Lemmas 34c and 34b, F /+1 cF^w, a,, a 2 , 
...,a i), where for each », afeFV(u). Using an ad- 

ditional subscript to keep track of which F } we are working 
with, we could rewrite this as ¥j +1 cF j (u J< aj <l , a J 2 , 
. . . , fly ). Now, if we start with F and adjoin 
successively the numbers: u 0 > a o , i> a o , 2 > • • • > a o. Po ’ u i> 

fli, |> fli. 2 - • • • . a \,p,’ •••.««-! a n- I.fl,.,. we obtain a 

sequence of radical extensions such that the final one 
contains ¥ N and hence E. Thus / is solvable by radicals. 


To prove the “only if” part of the theorem, we 
successively revise the sequence of radical extensions 
guaranteed by the definition of solvability by radicals 
until we obtain a sequence of normal extensions of prime 
degree such that the last one exactly equals E, the splitting 
field. To begin, we have: 

Lemma 34e. If f is solvable by radicals, there is a 
sequence of fields F = F 0 cF,c-- - cF^ such that 
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E c F w and such that for each j, F- + , = F 'j(af) where 
ap £ F j for some prime pj. 

Proof. In the original sequence guaranteed by the 
definition of solvability, take an arbitrary pair of 
successive fields f ,cf, +i = ¥j(a), where a" £ F y . Let n 
be written as a product of primes, n = p x p 2 • ■ ■ p k - By 
adjoining to Fy the elements a P2P:> />t , a p * p * ' Pk , ... , a Pk , 
a in succession, we obtain F- + , through a sequence of 
radical extensions of the required type. If we insert such 
fields between every pair in the original sequence, we 
obtain a new sequence of the required type. (For 
convenience, let us agree to represent the new sequence by 
the same notation as the original.) ■ 

Next we should like to modify this sequence of fields to 
a sequence of normal extensions of prime degree, but first 
there are some preliminaries. 

Lemma 34f. If « is a primitive pth root of unity, where p 
is a prime, and if K is any field, then G(K(w)/K) is cyclic. 

Proof. If <j> £ G(K(u)/K), </>(w) must be a conjugate of 
u and hence must equal some power of w. Moreover, <f> is 
uniquely determined by its value at co. For each 
([> £ G(K(«)/K), let <#> be the restriction of <p to the domain 
Q(to). Then is obviously an automorphism of Q(co) and 
hence an element of G(Q(u)/Q). Moreover, {<j> | <> £ 
G(K(«)/K)} is a subgroup of the cyclic group G(Q(w) 
/Q), and hence it is itself cyclic. (See Problem 1, Section 
3.3 and Problem 10, Section 3.2.) Let ip be a generator of 
this subgroup and let f be the corresponding element of 
G(K(w)/K), \p being uniquely determined since it is 
determined by the value of xp(cc). Then xp is a 
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generator of G(K(w)/K), since <j> = (\p) k implies that 
<t> = (\p) k , again because they are uniquely determined by 
their values at to. i 

Lemma 34f will be used in conjunction with Lemma 
34g below in order to adjoin to F, by a sequence of 
normal extensions of prime degree, as many roots of unity 
as we shall want. 

Lemma 34g. Every cyclic group is solvable. 

Proof. If G is a cyclic group of order n, we may write 
G ={<!>, <p 2 , ..., <t>" = id}, where 4> is a generator of G. If 
we write n as a product of primes, n = p x p 2 ■ ■ ■ p k , then 
the cyclic subgroups of G generated by <j>, <j> Pl , <t> p,p \ 
. . . , <j> p,P2 " Pk , respectively, each have prime index in 
the previous one, for their orders are respectively 

P 1 P 2 P 1 ' ‘ ' Pk’PiPi Pk> Pi' ■■ Pk 1- Moreover, 

since G is Abelian (Theorem 30), each of its subgroups, 
and in particular these, must be normal, because 
<j> ~ '//<£> = (<|> tyfp | ip E //} = {</> ~ = 

£//} = //, for all <f> E G. ■ 

The last auxiliary result needed is the following: 

Lemma 34h. If K contains the pth roots of unity, where 
p is a prime, and if a p E K but a ^K, then K(a) is a normal 
extension of degree p over K. 

Proof. If w is a primitive pih root of unity, then the roots 
of x p - a p are a, w a, w 2 a, . . . , w p ~'a. Since these are all 
in K(a), K(a) is the splitting field over K of x p - a p ; and 
so it is a normal extension of K. Since o^K, |G(K(a)/K)| 
= [K(a) : K] > 2, and so there exists an automorphism 
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<f> E G(K(a)/K ) such that <p(a) ± a. But is a root of 
x p - a p , and so <f>(a)/a is a p\h root of unity; not being 
equal to one, this must be a primitive pih. root of unity, 
say to. It follows that <f> ; (a) = <fd~ '(ton) = axfd^fa) = • • • 
= to-to since <j>(w) = to because u£K. Therefore the 
automorphisms </>, <j> 2 , <f> 3 , . . . , <j> p are all distinct since 
<f> y (a) = u j a ^ to'a = <f>'(a) for 1 < i < j < p. Conse- 
quently |G(K(n)/K)| > p. At the same time |G(K(a)/K)| 
= [K(a) : K] = deg K a < p, so that |G(K(a)/K)| = p. 
Therefore, [K(a) : K] = p. ■ 

Now we are ready to move into the mainstream of the 
proof of the second half of Galois’ Theorem. 

Lemma 34i. If f is solvable by radicals, then there exists 
a sequence of fields F = E 0 c E, c • • • cE„ such that 
EcE m and such that for each j, E /+ , is a normal extension 
of prime degree over E y . 

Proof. Returning to the sequence of fields given by 
Lemma 34e, let us make a list of all the a’s and p’s 
occurring in that sequence: p 0 , a 0 , p x , a x , p 2 , a 2 , . . . , 
p N _ j, a N _ { . Let to, denote a primitive p,th root of unity. 
By starting at F and adjoining successively to 0 , a 0 , to,, 
a,, <o 2 , a 2 , . . . , <o^_i, a N _ v we shall be able to obtain the 
desired sequence of field extensions. This is really very 
simple. Each time we adjoin an to,, the group of the 
extension is cyclic (Lemma 34f) and hence solvable 
(Lemma 34g). Since the extension is actually the splitting 
field of the pfh cyclotomic polynomial, by Lemma 34a it 
can be obtained by a sequence of normal extensions of 
prime degree. Each time we wish to adjoin an a,, since to, 
must already have been adjoined, either the extension is 
of degree one (i.e., a, already belongs to the previous 
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field), or else Lemma 34h guarantees that the extension is 
a single normal extension of prime degree p,. 

To formalize these ideas, we proceed by induction on 
N, the number of extensions given by Lemma 34e. If 
N = 0, we are already done; no extensions are necessary. 
As the inductive hypothesis, we now assume that for 
N = R it is possible to modify the sequence F 0 c F, 
C • • • C Ffl into the desired type of sequence E 0 c E, 
C • • • CE W such that F R c E M . For N = R + 1, we 
simply want to adjoin u R and a R to E M . If u R £ E M , we 
go on to a R . If u r #E m , by Lemma 34f, G{E M {u R )/E M ) is 
cyclic; by Lemma 34g it is solvable. Since E^w*) is the 
splitting field over E M of the p R ih cyclotomic polynomial, 
(x p * - l)/(x - 1) = x p * ~ 1 + x p *~ 2 + • • • + x + 1, 
Lemma 34a guarantees that there is a sequence 
E m C E m+1 c • • • C Ej = E^Oofl), in which each field is 
a normal extension of prime degree over the previous one. 
Next we adjoin a R . If a R £ E y , we are already done. If 
a R ^E Jy by Lemma 34h, E y + , = Ej{a R ) is a normal 
extension of prime degree p R over E y . Thus the sequence 
Eq c E, c • ■ • c E w has been extended to include u R 
and a R , as desired. ■ 

Finally we wish to modify the sequence of fields given 
in Lemma 34i so that the last one actually equals E, the 
splitting field of /. 

Lemma 34j. If f is solvable by_ radicals , then there exists 
a sequence of fields F = E 0 c E, c • • • C E y such that 
E = E y and such that for each j, E - + , is a normal extension 
of prime degree over Ey. 

Proof. Referring to the fields given in Lemma 34i, 
define K ; = E y H E. Thus we have F = K 0 c K, c • • • 
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CK„ = E, and it remains to show that once repetitions 
have been omitted from this sequence, it will be exactly 
what we want. Consider the following diagram: 



where a line indicates that the upper field is an extension 
of the lower one. As shown in the diagram, we can write 
K y + 1 as a simple extension of K y by some element r. We 
want to show that either Ky +1 = K ., or else that K. +1 is a 
normal extension of prime degree. 

It is certainly the case that r e E since r e K y+1 c E. If 
also r e Ey, then r e K y = E y HE, and so K >+1 = Ky. If the 
only other possibility holds, namely, r € E y , then E y+1 = 
E y (r); for E y g E y (r) c E y + 1 . Since [E y (r) : E y ] divides the 
prime p = [ E >+1 : E y ],[E y (r ) : E y ] = p. By normality of E JJf x 
over E y , there are exactly p automorphisms in G(Ey +1 /Ey), 
and each maps r to a distinct conjugate of r over Ey. Call 
these conjugates r = r x ,r 2 ,...,r p . Conjugates of r over E y 
are necessarily conjugates of r over K y , since the minimal 
polynomial for r over E y must divide the minimal poly- 
nomial for r over K ., the latter polynomial considered as a 
(not necessarily irreducible) polynomial over E y with r as a 
root (Problem 6, Section 2.1). These conjugates are all both 
in E y+1 and E (since E is normal over the intermediate 
field K y . between F and E), and so they are in K y+1 = E - +1 
HE. (Note: we do not know whether there may be other 
conjugates of r over Ky, also in K y + 1 .) 

For each of the p automorphisms <£, e G(E y+1 /E y ), 
define to be its restriction to the domain K - +1 . We want 
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to show that the </>, are all automorphisms of K . +1 , that 
they leave K 7 fixed, that they are distinct, and that they 
form a subgroup of G(K- +1 /K-). (Ultimately, we will see 
that they constitute all of G(K >+1 /K y ).) Recall from the 
proof of Theorem 31 that each <}> i is uniquely determined 
by which conjugate r, it maps r to, and further that the 
actual image of any element is obtained by replacing r by 
r, in any representation of an element as a polynomial over 
the base field, evaluated at r. Since each r- e K y+ i. it 
follows that <t>,(Kj) c K . +1 , for each i. From the proof of 
Theorem 31, it is an automorphism. For all K ; is left 
fixed, since it is a subset of E , all of which is left fixed by 
<t>,. The *,.*s are distinct because the r ’ s are distinct, and 
they form a subgroup because they are closed under com- 
position and are finite (Lemma 29b). Let us call this 
subgroup H. Recall that \H\ = p. 

Let K H denote the fixed field of H, and let K c denote 
the fixed field of G(K J+1 /Kj). We wish to show that 
K y c K f; c K„ c K from which it will follow that there 
must be equality throughout. Clearly K y c K c , since each 
element of G(K 7+1 /K 7 ) leaves all of K y fixed, by defini- 
tion of the group. Also, K fl cK ff because anything left 
fixed by all of G(K 7+1 /K 7 ) is certainly left fixed by 
automorphisms in the subset H. To show K w c K y , take 
an arbitrary element s e K w . It is left fixed by all the </>,’s, 
hence by all the <J>,’s, and it is in E j, since it is in the fixed 
field of G(E ;+1 /E y ), which equals E ; by normality of E J+1 
over E-. But also jeE since jeK j+ ,cE. We conclude 
that igKj = E 7 n E, and thus that K ff cK j. As noted 
above, this implies that K, = K c , or that K • is the fixed 
field of G(K y+1 /K y ). By Theorem 32, K 7 + 1 is a normal 
extension of K 7 and [K y+1 : K] = |G(K 7+1 /K 7 )|. By 
Corollary 32, G(K 7+1 /K ; ) = H, since they have the same 
fixed field, and thus [K y+1 : K] = \H\ = p. 
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We have shown then that there are only two 
possibilities at each step: either K^ +1 = K ■ or Ky + , is a 
normal extension of prime degree over K ■. By listing the 
K’s, but excluding the repetitions, we obtain the sequence 
of fields sought. ■ 

From here the final step in the proof of Theorem 34 is 
almost immediate. 

Lemma 34k. If f is solvable by radicals, then its Galois 
group is solvable. 

Proof. Corresponding to the fields given in Lemma 
34j, the fundamental Galois pairing applied to the normal 
extension E over F produces the normal subgroups 
G(E/F), <7 (E/E,), G(E/E 2 ), . . . , G(E/E y ) = (id). The 
index of G(E/E. + 1 ) in G(E/E,) is |G(E/E,)|/|G(E/E, + I )| 
-[E:E,]/[E:E, + 1 J-[E, + , : E y ], which is prime. Thus 
G(E/F), the Galois group of /, must be solvable! ■ 

Now that we have a necessary and sufficient condition 
for solvability by radicals, it is natural to ask what can be 
done with it. After all, up to this point we have not even 
calculated the Galois groups of any but the most trivial 
polynomials; and if you can’t calculate the Galois group 
of a polynomial, you could hardly expect to be able to tell 
whether it’s solvable or not. The calculation of the Galois 
group of x 4 - 2 is developed in the problems. In the next 
section, we shall explicitly calculate the Galois group for a 
certain class of polynomials, and we shall show that these 
particular groups are not solvable. From this, of course, 
will follow the fact that there exist polynomials which are 
not solvable by radicals. In the subsequent section, we 
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shall treat several polynomials whose groups are solvable; 
and by reconstructing the ideas in the proof of Theorem 
34, we shall actually show how to solve them by radicals. 

PROBLEMS 

1. If E is a normal extension of F and if K, and K 2 
are intermediate fields, describe G(E/K, H K 2 ). 

2. In the context of Problem 1, show that if K, and 
K 2 are actually normal extensions of F, then G(E/K, D 
K 2 ) = {# | <t> e G(E/K,), i|/ e G(E/K 2 )}. 

3. If E is a normal extension of F with intermediate 
fields K, and K 2 , and if K is the smallest field containing 
K, and K 2 , what is G(E/K)? 

4. Let o) be a primitive fifth root of unity. Find a 
field F such that [F(«) : F] = 2. 

5. Let oi be a primitive 31st root of unity. Show that 
there exists a field F such that [F(u) : F] = 6. 

6. Find the order of the Galois group of x 4 - 2; and 
then describe completely, in some explicit way, each of its 
elements. 

7. Find all the subgroups of the group obtained in 
the previous problem, and determine which ones are 
normal subgroups. 

8. If E is a normal extension of K and if K is a 
normal extension of F, does it follow that E is a normal 
extension of F? 

9. If H is a normal subgroup of a group G and if N is 
a normal subgroup of H, does it follow that N is a normal 
subgroup of G? 

10. Why should it be obvious that the Galois group 
calculated in Problem 6 is solvable? Verify its solvability 
directly. 
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11. If F is a field containing no plh roots of the 
number A e F, for a fixed prime p, show that the poly- 
nomial x p - A is irreducible over F. 


Section 3.6. Abel’s Theorem 

In this section we shall prove that there exist polynomials 
which are not solvable by radicals. This will be 
accomplished by explicitly calculating the Galois groups 
of a whole class of polynomials and then by showing that 
the resulting groups are not solvable. As was noted at the 
end of the previous section, we have not spent much effort 
up to now on the question of how to find, or even how to 
describe effectively, the Galois group of a polynomial. For 
certain very simple cases such groups were described in 
the text; in Problem 1 of Section 3.3 the Galois group of 
the /»th cyclotomic polynomial was calculated; and in 
Problem 6 of Section 3.5 the Galois group of x A - 2 was 
calculated. But in all these cases considerable knowledge 
about the roots of the polynomials was used in describing 
their groups. In this section we shall see an example of 
how it is possible to describe and calculate Galois groups 
without much prior knowledge of the roots of the relevant 
polynomials. 

Let us assume that / is a polynomial of degree n over a 
field F and that the roots of / are labelled in an arbitrary 
but fixed way as r,, r 2 , . . . , r n . Let us further assume that 
these roots are all distinct. (This will be true, for example, 
in the special case when / is irreducible over F.) Letting E 
be the splitting field over F of /, we have E 
= F(r,, r 2 , . . . , r„). Recall that the Galois group G 
— G( E/F) is the group of all automorphisms of E which 
leave each element of F unchanged. Because of the fact 
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that if <f> E G,f(r) = 0 implies that <#>[/(/•)] = /[4>( r )l = 
each automorphism of G maps the set of roots of / onto 
itself; that is, each 4> £ G simply induces a permutation on 
the numbers r„ r 2 , . . . , r n . Moreover, the action of <f> on 
these roots completely determines its action on all of E, 
for any element of E can be expressed as a polynomial 
combination of these roots. Therefore, any <f> E G can be 
described completely by describing the permutation which 
it induces on the r/s. 

There happens to be a convenient standard way to 
represent permutations. To begin, suppose <j> e G and let 
Tj be one of the roots of /. We can start to make a list: 
rj, <j>(rj), <p 2 (rj) .... Eventually some must repeat 

something earlier in the list, since there are only n possible 
values for the numbers in the list to assume. If k 
corresponds to the exponent of <j> in the first such 
repetition, then </>*(r y ) = Tj itself; for if <f>*(r y ) = <f >'(')) with 
i > 1, then <^ k ~\r } )- ^ would correspond to an earlier 
repetition. The list of numbers r y , <J>(V ; ), </> 2 (r,), . . . , 
<t> k ~\ r j) is called a cycle, for each entry is obtained by 
applying <j> to the one before it, and the first is gotten by 
applying </> to the last. Now, if r m is another root of / not 
in the cycle just constructed, we can construct in the same 
way a cycle containing r m ; and obviously the set of roots 
occurring in this cycle will be disjoint from the set of 
those occurring in the previous one. Continuing in this 
fashion, the action of <f> on the roots of / can be described 
by a list of disjoint cycles. In writing these cycles, let us 
agree to represent the root rj simply by writing the number 
j; further we shall enclose cycles in parentheses and 
juxtapose disjoint cycles. 

For example, if / is an irreducible polynomial of degree 
5 with (distinct) roots r,, r 2 , r v r 4 , r 5 , suppose that there 
is some <j> E G, such that <p(r t )= r 4 , <f>(r 2 ) = r v 
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<p(r 3 ) = r } , cf>(r 4 ) = r 2 , and <p(r 5 ) = r 3 . This information 
completely specifies <p. An abbreviation for <j> would be 
(1 4 2)(3 5); for, reading from the left, it says that r, is 
mapped to r 4 , r 4 to r 2 , r 2 to r,; also r 3 is mapped to r 5 and 
r 5 to r 3 . There is some leeway in this system of 
abbreviation; in particular, disjoint cycles may be 
juxtaposed in either order, and any cycle may be begun 
with any of its elements. Thus, another representation of 
the <f> given above would be (5 3)(2 1 4). As a further 
abbreviation, in writing the cycles representing a given 
automorphism, we shall agree to omit any cycles 
consisting of a single element; such cycles correspond to 
roots which are left fixed by the automorphism. For 
example, the automorphism represented by (5 3 2) maps r 5 
to r 3 , r 3 to r 2 , r 2 to r 5 and it leaves r, and r 4 each 
unchanged. A cycle containing exactly k elements is 
called a k-cycle. A 2-cycle is called a transposition. 

With the obvious meaning, we shall use the equality 
sign between the name of an automorphism and its 
representation. Thus, we could write <#> = (5 3 2) to denote 
the automorphism described by this 3-cycle. With this 
kind of identification in mind, we shall say that a group 
contains a k-cycle if there is an element of the group equal 
to some k-cycle (as opposed to an element of the group 
simply containing a k-cycle). 

The notation just introduced is convenient for com- 
puting the product of automorphisms. For example, in 
the case n = 5, suppose <#> = (1 3 4 2) and ip = (2 5)(4 3 1); 
we can compute 

# = [(1 3 4 2)][(2 5)(4 3 1)] 

= (125) 

by tracing the effect on each number through the relevant 
cycles as these are encountered from right to left. That is. 
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one might proceed as follows: Pick any number, say 1; by 
the first cycle on the right it is mapped to 4; moving now 
to the left until the next occurrence of 4, we see that 4 is 
mapped to 2. Thus 1 is ultimately mapped to 2 by the 
product. Next, we need the effect of the product on 2; it is 
mapped to 5 by (2 5) and then 5 is left fixed; thus 2 is 
mapped to 5. Continuing in this way, we find that 5 is 
mapped to 1 and both 3 and 4 are left fixed. As another 
example, the reader should verify that 

# = [(2 5)(4 3 !)][(! 3 42)] 

= (2 4 5). 

It is easy to see that the inverse of any automorphism is 
simply obtained by reversing the order within each of its 
cycles. Thus, if ip = (2 5)(4 3 1), then i//~' = (5 2)(1 3 4). 

In the case of a polynomial / of degree n with all its 
roots distinct, we have seen that to each element of the 
Galois group there corresponds a permutation on the n 
roots, or equivalently, a permutation on the subscripts 1 
through n. It is customary to denote by S„ the set of all n\ 
permutations on the numbers 1 through n. If every such 
permutation occurs under some element of the group G, 
we say that G — S n or that G is the symmetric group on n 
objects. In many cases, however, not all permutations 
occur under elements of G. For example, we know 
(Problem 3 of Section 3.3) that the Galois group of x 5 - 1 
has only four elements, even though there are 5 ! = 1 20 
possible permutations of the five roots. 

Under certain circumstances, the Galois group of a 
polynomial will include all permutations of its roots. In 
particular, we have: 

Lemma 35a. Iff is an irreducible polynomial over Q, of 
prime degree p, and if f has exactly p - 2 real roots, then 
the Galois group of f is S p . 
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Proof. Let the roots of / be r,, r 2 , , r p , where r, and 
r 2 are the two complex, nonreal roots. The roots are all 
distinct since / is irreducible. First we shall show that the 
Galois group G is transitive, that is, for every pair of roots 
r, and r jy there exists some <j> E G such that <£(r,) = r-. To 
see this, let r ( - , r,- 2 , . . . , r^ denote the complete set of 
distinct images of r, under elements of G. Let P be the 
monic polynomial of degree m with roots r ( - , r, , 

That is, P(x) = ,(x - r, t ). The coefficients of P are 

simply plus or minus the elementary symmetric functions 
of m variables, evaluated at these r s. We would like to 
show that this is actually a polynomial over Q. To do this, 
note that the j th elementary symmetric function 
Oj{r i t r jjt . . . , r im ) is unchanged by every <j> E G, for 

r v • • • • r O] = a M r ‘)> <Kn 2 )’ • • • . rt'J) 

and the arguments on the right are distinct (since </> is 
one-to-one) and each is an image of r, under an element 
of G. Since there are m of them, they must simply be a 
permutation of the list r (| , r (j , . . . , r^. Therefore, since a, 
is symmetric, its value is unchanged. Consequently, for 
each j, Oj(r it , r, 3 , . . . , r im ) E Q, and so the numbers 
r it , r (j , are the roots of a polynomial of degree m 

over Q. However, this polynomial obviously divides / in 
QM; so since / is irreducible, m— p. Thus the images of r, 
exhaust the set of roots, and hence G is transitive. (So far 
we have only used the hypothesis that / is irreducible.) 

Next we show that G contains the transposition (1 2). 
Since the complex roots of a polynomial with real 
coefficients occur in complex conjugate pairs, r, and r 2 
must be complex conjugates. The automorphism of C 
defined by complex conjugation, when restricted to E, 
must interchange r, and r 2 and leave the other roots. 
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which are real, unchanged. Since this restriction is easily 
seen to map E onto E, it is an element of G. Its 
representation is (1 2). 

Now we show that G contains every transposition. To 
do this, define an equivalence relation on the set 
S — { 1, 2, . . . ,p) by i~j if and only if i = j or the 
transposition (/' j) belongs to G. This relation is obviously 
reflexive and symmetric; transitivity follows from the fact 
that if ( i j) and (j Ac) belong to G, then ( j k)(i j)(j k) = 
(/ k) G G. Let us denote by a bar the equivalence class of 
the element below it; a one-to-one correspondence 
between any two equivalence classes i and j can be 
obtained from any automorphism <j> that maps r, to r. To 
see this, let k represent an arbitrary element of S, k # /, 
and define m by the equation <j>(r k ) - r m . Then k G i if 
and only if (/' Ac) G G if and only if k)<p~ l = (y m) G G 
if and only if m G j. So under this correspondence, i is 
associated with j and each A: # / is associated with the m 
as defined above. Our equivalence relation divides S into 
disjoint equivalence classes, and now we know that each 
has the same number of elements. Since p is prime, either 
there are p equivalence classes of one element each or 
there is one equivalence class of p elements, namely S 
itself. The former case cannot hold, since both 1 and 
2 G 1. Therefore G contains all transpositions. Since every 
permutation can be obtained by a succession of 
transpositions (Problem 7, Section 1.6), G must equal all 
of S p . ■ 

At the same time, we have: 

Lemma 35b. If n > 5 and the Galois group G = S n , then 
G is not solvable. 
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Proof. First we establish the preliminary claim that if 
N is a normal subgroup of prime index p in a group H, 
then for any two elements <p,\pEH, the product 
<p~'\p- l <p\p E N. (The product <#> " 'ip - '<t>^ is called the 
commutator of <j> and i p.) If <j> E N, by normality 
ip~ , <P4 / E N, and then by closure G N also. If 

<p$N, consider the set N = {<p k n \ n E N, k E Z }. It is a 
subgroup of H , since id = <£°id G N; and (<p k n)~' = 
n~ , <j>~ k = <p~ k n E N, for some h E_N by the normality of 
N\ and (^ k n l )(4> J n 2 ) = <p k <j> j h l n 2 E N, again making use of 
normality. By Lagrange’s Theorem |lV| = ^|Af| for some 
integer q, since TV is a subgroup of N. At the same time, 
\N\ divides |//|=/>|/V|, and so q = 1 or q — p._ Since 
N ^ N, and so q ± 1. Thus q = p and N = H. 
Consequently we may write ip = <t> k n for some k E Z and 
some n E N. But then, = t(>~ l rt~ , <l>~ k <p<t> k n 

= <p- ] n~'<f>n E N, since G N by normality and 

since N is closed. 

Now suppose that G is solvable, so that there exists a 
sequence of subgroups G = H 0 d H t D H 2 d • • • D H N 
= {id}, where each H J+i is a normal subgroup of prime 
index in Hy We show that this is impossible, by showing 
that for every j, Hj contains every 3-cycle. (This yields a 
contradiction for j = N.) The proof is by induction. Since 
G = H 0 - S„, H 0 contains every 3-cycle. Assuming now 
that Hj contains every 3-cycle, we show that H J+l does 
also. Let (/ j k) be an arbitrary 3-cycle and define <f> = (mj 
i) and <// = (// k), where /, j, k, /, and m are all different 
(which is possible since n > 5); by the inductive 
hypothesis <J> and ^ G Hj. By the earlier claim, 

E Hj +l . But if we compute <f> “ - '<p\p - ( i j m)(k l i)(m j 

i)(i / k) = (/ j k), we see that the arbitrary 3-cycle (/ j k) is 
in H j+V i 
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We may now prove our main result, Abel’s Theorem: 

Theorem 35. There exist polynomials of every degree 
> 5 which are not solvable by radicals. 

Proof. In the light of Lemmas 35a and 35b, we first 
seek an irreducible polynomial of degree 5 over Q which 
has exactly three real roots. The polynomial f(x) = 2jc 5 - 
5x 4 + 5, which was discussed in Section 3.1, is one such 
polynomial. Multiplying it by x k , we obtain a polynomial 
of degree 5 + k which is also not solvable by radicals 
(because not all of its roots can be solved for in terms of 
radicals). ■ 

In the case of the polynomial 2x 5 — 5x 4 + 5, it is 
actually easy to see that none of its roots can be expressed 
in radicals. For if a single root r were to belong to a field 
K obtainable from Q by a sequence of radical extensions, 
then division of 2x 5 - 5x 4 + 5 by x - r would give a 
quartic over K, and we have already seen (Problem 4, 
Section 3.1) that this must be solvable by radicals. 
Alternatively, the same result follows from the fact that in 
the case of an irreducible polynomial, either none or all of 
its roots may be expressed in terms of radicals (Prob- 
lem 5). 

PROBLEMS 

1. Let w be a primitive fifth root of unity, and label 
the roots of x 5 - 1 as: r, = 1, r 2 = to, r 3 = to 2 , r 4 = to 3 , r 5 
= to 4 . Give an example of a permutation of these roots 
which does not correspond to an element of the Galois 
group of the polynomial. 
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2. Label the roots of x A - 2 as: r, = +\j2 (positive 
real), r 2 = i \J2 , r 3 = - A \[2 , r 4 = — i \/2 . Give an ex- 
ample of a permutation of these roots which does not 
correspond to‘ an element of the Galois group of the 
polynomial. 

3. Prove or disprove: If / £ F[x] is irreducible, then its 
Galois group over F is transitive. 

4. Find a fifth degree polynomial of the form 
x 5 + bx + c, such that it is not solvable by radicals. 

5. If / G F[jc] is irreducible and there is a root r of / 
and a sequence of radical extensions F = F 0 c F, c • • • 
C Fjy such that r G F^, show that / is solvable by radicals 
over F. (That is, if one root of an irreducible polynomial 
can be obtained by radicals, all the roots can be.) 

6. Suppose that the group of a polynomial / over a 
field F is S n . Show that / is irreducible over F. Use this 
result to show that if in addition n > 5, then no root of / 
may be expressed in radicals. 

7. Show that for every prime p , there exists a poly- 
nomial of degree p over Q which has S p for its Galois 
group. 

8. For every n, show that there exists a field F and an 
/ith degree polynomial / G F[x] such that the group of / 
over F is S„. (Hint: Review Problems 6 and 7 of Section 
3.3.) 

Section 3.7. Some Solvable Equations 

In the previous section we applied one of the two 
implications contained in Galois’ Theorem in order -to 
prove Abel’s Theorem. In particular, we produced a 
polynomial with an unsolvable group, and therefore a 
polynomial not solvable by radicals. In this section, we 
shall apply the converse implication of Galois’ Theorem, 
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namely, that a polynomial with a solvable group is 
solvable by radicals. It should be mentioned at the outset 
that there actually do exist (rather complicated) proce- 
dures for calculating the Galois groups of large classes of 
polynomials, but we shall not pursue this general topic 
here. Instead, let us simply apply Galois’ Theorem to 
several situations in which the group is known. 

First, we observe that if w is a primitive />th root of 
unity, p a prime, it can actually be calculated by a 
sequence of rational operations and the extraction of roots 
of lower order than p. To see this, recall that the Galois 
group G of the />th cyclotomic polynomial /( x) = x p ~ 1 + 
x p ~ 2 + • • • + 1 is cyclic and of order p — 1 (Problem 1 
of Section 3.3). Consequently, G is solvable and has a 
composition series whose factors />, are the prime factors 
of p - 1. (See Lemma 34g and its proof.) As in the proof 
of Lemma 34d, / can be solved by means of radicals of 
order /?,, as p t ranges over all the composition factors. 

From the observations of the previous paragraph, we 
immediately obtain a short proof of Theorem 28, which 
asserts that the regular n-gon is constructible if the odd 
factors of n are distinct Fermat primes. For, as noted at 
the outset of the original proof, it suffices to consider the 
case where n— p, a Fermat prime; and so we want to 
show that a primitive /?th root of unity is constructible. 
Well, since p - 1 is a power of 2, the />th cyclotomic 
polynomial may be solved using radicals only of order 2. 
Since these operations correspond to admissible construc- 
tions, the p\h roots of unity are constructible in this case! 

It is interesting to relate the original proof of Theorem 
28 to the one just given. For convenience, let us treat the 
case p = \1, although the ideas are completely general. 
The Galois group of the 17th cyclotomic polynomial is 
cyclic and of order 16, and so it may be represented as 
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G = {<#>, (j> 2 , ... , <J> 16 = id), where <j> is determined by the 
condition <f>(u) = u g in which g is a primitive root modulo 
17 and to is a primitive 17th root of unity. Thus a complete 
list of the roots of the 17th cyclotomic polynomial is: 

<t>(u), <j> 2 (u), <£ 3 (to), . . . , <#> ,6 (to). 

A composition series for G is G = H 0 d H 2 D H 3 
D // 4 = {<f> 16 = id}, where //, = {<j> 2 , <j> 4 , <> 6 , <f> 8 , <j> 10 , 
<!> 12 , cj> 16 }, H 2 = { <j> 4 , <j>\ <> 16 }, and H 3 = {<#, 8 , <#, 16 }. 

Under the fundamental Galois pairing, there exists a 
sequence of fields Q = F 0 c F, c F 2 C F 3 c F 4 = Q(co), 
such that each is a normal extension of degree 2 over the 
previous one (that is, a quadratic extension) and such that 
each F, is the fixed field of //,. Let us actually calculate 
the fields. We shall see that each F y is the extension of 
F y _, by one (and, in fact, all) of the y'-periods, and that 
complementary y'-periods are conjugates over F 7 _,. 

For example, consider the 1 -periods, which we can 
write here as 

tji = tj>(u ) + tf> 3 (u) + • • • + <> 15 (<d), 

TJj — </> 2 (w) + ^(w) + • • • + W)- 

Since they are both left unchanged by the elements of H t , 
they both belong to F,. By the linear independence over Q 
of the numbers to 1 ... w 16 , which form a basis for Q(w) 
over Q, tj, ^ tj 2 . As a consequence, they are not left fixed 
by some elements of H 0 , and so they are not in F 0 . In fact, 
since tj, and tj 2 are either left unchanged or mapped to 
each other by the elements of H 0 , they are a complete set 
of conjugates over F 0 . Therefore they are the roots of a 
quadratic polynomial over F 0 . By exactly the same 
reasoning, the reader should be able to carry the argument 
through to subsequent extensions. 
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PROBLEMS 

1. What is the group of an irreducible quadratic? Use 
the definition of a solvable group to show that it is a 
solvable group. Then carry through the procedure used in 
the first part of the proof of Galois’ Theorem to derive an 
actual formula for its solution, a formula that should turn 
out to be the quadratic formula. 

2. Repeat the previous problem for a cubic equation 
under the assumption that the group is S y Does your 
solution depend on the fact that the group is all of S 3 ? 
(Cf. Problem 1 of Section 3.1.) 

3. Let / be a cubic polynomial with real coefficients 
and let F be the smallest field containing the coefficients. 
If r,, r 2 , and r 3 denote the roots of /, we define the 
discriminant of / to be A 2 = IX</( r ; - r jf- Show that 
A 2 E F. Show further that if A 2 > 0 all the roots are real 
and if A 2 < 0 there is exactly one real root. What if 
A 2 = 0? 

4. If r is any root of an irreducible cubic /, over some 
field F, and if A is a square root of the discriminant, show 
that the splitting field E of /, over F, is given by F(A, r). 

*5. Let /be an irreducible cubic over Q with three real 
roots. Show that it is not possible to solve for any of its 
roots by real radicals alone. (This may be surprising.) 

6. Use Galois’ Theorem to show that every quartic is 
solvable by radicals. (Cf . Problem 4, Section 3. 1 .) 

REFERENCES AND NOTES 

Early approaches to low degree polynomial equations are discussed in 
[1, 2]. Edited translations of some of the original writings are also 
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English [4], The work of Abel and of Ruffini on the quartic equation is 
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CHAPTER 4 


POLYNOMIALS WITH SYMMETRIC 
GROUPS 


Section 4.1. Background Information 

The proof of Abel’s Theorem showed that for every n > 5 
there exists a polynomial / of degree n over Q such that at 
least one root of / cannot be expressed in radicals. A 
natural question is whether a stronger result may actually 
be true, namely: For each value of n > 5, does there exist a 
polynomial / of degree n over Q such that none of the roots 
of / can be expressed in radicals? As was pointed out just 
after the proof of Abel’s Theorem, the quintic polynomial 
used there actually has this stronger property. It follows 
from Problems 5, 6, and 7 of the same section that for all 
prime values of n > 5, there also exists such a polynomial. 
In a slightly different vein, if we drop the restriction that 
the coefficient field be Q, Problem 8 of that section shows 
that for every n > 5 there always exists some polynomial 
of degree n, none of whose roots can be obtained by a 
sequence of radical extensions of the coefficient field. 

All of these partial results were obtained by finding 
polynomials whose Galois groups were S„, for various 
values of n. Thus it is natural to attack the previous 
question by asking: For what values of n does there exist a 
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polynomial over Q whose Galois group is 5„? This question 
is quite difficult, and our solution will make use of 
techniques of both an algebraic and an analytic nature. 

In this section we shall recall certain facts from calculus 
concerning infinite series, and we shall use these to 
develop a classical theorem of algebraic geometry that is 
basic to our approach. It will also be necessary to develop 
a rather technical mean value theorem. In the next 
section, this material will be used to establish a 
fundamental result, the Irreducibility Theorem of Hilbert. 
This theorem asserts that if /(/,, t 2 , x) is an 

irreducible polynomial in n + 1 variables over Q, then 
there exist rational values for t, through t n such that the 
resulting polynomial in x is irreducible over Q. Using this 
theorem, we shall show in the last section that for every 
positive integer n, there actually does exist a polynomial 
over Q whose group is S n . If one wishes simply to assume 
Hilbert’s Theorem, it is possible to proceed directly to 
Section 4.3 without first studying Sections 4.1 and 4.2. 

It is supposed that the reader is acquainted with the 
basic theory of power series in the real domain. Let us 
recall the relevant facts. For every power series 2T-o a k (k 
there exists a value R, 0 < R < oc, called its radius of 
convergence, such that for |/| < R the series converges and 
for |f| > R it diverges. (There are various possibilities for 
\t\ = R, but these do not concern us.) In fact, for |/| < R 
the original series converges absolutely, meaning that the 
series 2 ol f * I converges. Absolute convergence at any 
point always implies convergence. Given two power series 
with respective radii of convergence /?, and R 2 , we shall 
refer to those t satisfying both |f| < R x and |/| < R 2 as 
being “within their common radius of convergence”. 

Power series in t are simply a generalization of 
polynomials in t, and they can often be manipulated in an 
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analogous fashion. In particular, within their common 
radius of convergence, power series can be added, 
subtracted, and multiplied to give new power series. They 
can also be divided as long as the denominator can be 
restricted to a radius within which it is never 0. The rules 
for such operations are analogous to those for poly- 
nomials. A computational example is given in Problem 1. 
Power series can also be differentiated term by term 
within their radii of convergence. 

An important theorem of which we shall make use is 
the Identity Theorem for Power Series, which says that if 
two power series have the same values on some interval 
(or even on any infinite set containing at least one point 
of accumulation), then their corresponding coefficients 
must all be identical. In particular, if we have a power 
series '2 l ? =0 a k t k identically equal to 0 on some interval, 
then each a k must be 0. 

Let ST-o a k tk have radius of convergence R > 0. Since 
for each individual t, |/| < R, we can multiply through by 
any number and still have a convergent series, it follows 
that for 0<|f|< R we can also multiply through by 
monomials of the form At~ J . The result is a series of the 
form 

b_ J t~ i + b_ j+i t~j +i + • • • + b 0 + b^t + b 2 t 2 + • • • . 

It is easy to see that such series can also be added, 
subtracted, and multiplied in the obvious fashion. (See 
Problem 2.) They can also be differentiated term by term. 

It is necessary to generalize slightly the context within 
which we shall work. So far we have considered series of 
the form a k {k where the coefficients a k are real. It is 
necessary for us to allow complex coefficients. If we have 
a sequence of complex coefficients a k and we write them 
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in terms of their real and imaginary parts, a k = b k + ic k , 
then let us make the definition 

I «/= 2 V‘+/S 

k = 0 A: = 0 k-0 

The minimum of the radii of convergence of the two real 
power series on the right will be taken as the definition of 
the radius of convergence of S“-o a***- With these 
conventions, all the previous statements about series carry 
through immediately to this case. Of course |a*| must then 
be interpreted as the modulus of the complex number a k . 
(Problem 3 asks for a verification of one such statement 
for this case. It might be advisable for the reader to think 
through some of the others.) 

A further generalization is possible, namely to the case 
where t is also allowed to assume complex values. All 
previous statements still hold in this context, but their 
proofs do not simply follow from the real case. The theory 
of such series needs to be constructed from the beginning, 
although in a manner analogous to the real case. For our 
work it is completely adequate to consider only real values 
of t, so we do not depend upon this theory. However, the 
reader who is comfortable with the more general case may 
be interested in considering our calculations to take place 
there, where they are both valid and formally identical. 

A complex-valued function / of a real variable t is said 
to be analytic at 0 if it can be represented by a convergent 
power series, 


/(')= I a k t k . 


*= o 


for all t satisfying |t| < R, for some R > 0. If / is defined 
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on such an interval then we may define a function 

*<“>■/(«) 

for all |«| > 1 / R. We say that g is analytic at oo if and 
only if / is analytic at 0. In this case we can express g in 
the form 

£(«)= 2 a k u~ k 

k = 0 

for |u| sufficiently large. Such a series is called a reciprocal 
power series. It is immediate from working in terms of the 
variable / = 1/m that all our operations on power series 
with positive radii of convergence are valid for reciprocal 
power series when |m| is sufficiently large. Similarly,/ is 
said to be analytic at t 0 , if the function 

g(u)=f(u+t 0 ) 

is analytic at 0. In this case we have an expansion 

OO OO 

/( 0= 2 a*w*- 2 a k(‘- ‘o) k 

k = 0 k = 0 


valid in an interval 1 1 — f 0 | < R for some R > 0. 

Leaving power series aside for the moment, there is one 
last preliminary observation that we shall need. If f(y) is a 
polynomial in y of degree n and if y 0 is any fixed value of 
y, then f(y) may be expressed as a polynomial of degree n 
in the variable ( y — j 0 ). It has the particular form 

f( 2 V y \ 

f(y) = /(/ o) + f\y 0 )(y - y 0 ) + (y - y 0 ) 2 


f (n \yo) 


( y-yo) n ■ 


+ • • • + 


n\ 
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This is a special case of Taylor’s Theorem, but its direct 
verification in the case of polynomials is very simple 
(Problem 4). 

Now we are ready to begin our development. In what 
follows it will be completely adequate to think of t as a 
real variable, although this restriction will not be made 
explicitly except where it is actually necessary for the 
validity of the calculations. 

Let /(f, x) be a polynomial in two variables over a field 
F. The set C of points (/, x) which are solutions to the 
equation 

/(/, x) = 0 

is called an affine plane curve. The nature of such sets is 
studied extensively in algebraic geometry, and we shall 
need to derive one of the fundamental classical results 
(Lemma 36b). By grouping together terms containing the 
same powers of x, we can write 

f(t> x) = a n (t)x n + a n _ ,(/>*"" '+ • • • +fl o (0» 

where each a k (t) is a polynomial in t over F and where 
a n (t) is not the zero polynomial. For each fixed value 
t = t 0 such that a n (t 0 ) ± 0, there are exactly n roots to the 
resulting polynomial in x, counted according to multiplic- 
ity. If these n roots are distinct, then t 0 is called a regular 
value of t. We shall shortly see that for all t sufficiently 
close to such a value t 0 , the n roots of the equation 
/(r, x) = 0 can be represented by n “root functions” 
x,(/)> x 2 (0> ■ • • > a„(/) which are analytic at t 0 . But first it 
is appropriate to verify that regular values always exist for 
the particular class of polynomials of interest to us. 

Given a polynomial /(?, x) over F, as above, we say that 
it is irreducible over F if it cannot be written as a product 
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f{t, x) = g(t, x)h(t, a), where g and h are also poly- 
nomials over F, each of degree at least one. Most points of 
affine plane curves corresponding to irreducible 
polynomials f{t, x) have regular values of /, as we shall 
presently see. Incidentally, there are actually two senses in 
which the word “degree” will be used in our discussion. 
The degree of a polynomial in two or more variables, as 
defined in Section 1.6, is the highest sum of the exponents 
of the variables among all nonzero terms. In contrast, the 
degree in a particular variable will be taken to mean the 
highest power of that variable found in the polynomial. 
Thus, for example, the polynomial tx 3 + t 4 x has degree 5, 
but its degree in x is 3. 

Lemma 36a. Let f(t, x) be an irreducible polynomial in 
two variables over the field F. Then all but a finite number 
of values of t are regular values. 

Proof. We exclude at the outset those finite number of 
t values which are roots of a n (t). Any other value t = t 0 
for which the g.c.d. of the polynomials in x,f(t 0 , x) and 
1x0 0’ x), is a nonzero constant must be regular. To see 
this, recall that the derivative is 0 at a multiple root, 
in which case the polynomial and its derivative have 
a common factor and hence a g.c.d. of degree greater 
than 0. 

Given any two polynomials g(t, x) and h(t, x) over F, 
we can write them in the form 

g(t,x) = b k x k + b k _ t x k -' + • • • + b 0 , 

h(t, x) = c m x m + c m _ i x m ~' + ■ • • + c 0 , 

keeping in mind that the coefficients are polynomial 
functions of t. From the calculations made in the proof of 
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the division algorithm (Theorem 11), it follows that there 
are polynomials q and r in x such that 


g = hq+ r 


and such that r has a lower degree in x than does h. The 
coefficients of q and r are specified as particular rational 
combinations of the coefficients of g and h, and so in this 
case they are quotients of polynomials in t over F, called 
rational functions of t over F. For arbitrary t we now apply 
the Euclidean algorithm to find the g.c.d. of f(t, a) and 
f x (t, a), considered as functions of x. This only involves 
repeated use of the division algorithm, and so the result 
has the general form of a polynomial in x with rational 
functions for its coefficients, say 


, , MO , , Pj- 1(0 , . 

r(t, x) = -Mr x J + — — — x J 1 


MO 


?/-i(0 


MO 
MO ' 


This divides into f(t, a) to give a quotient polynomial of 
the same type. That is, 

/(/, a) = r(t, x)s(t, x), 

where s is a polynomial in x with rational functions of t as 
its coefficients. By an argument analogous to the proof of 
Gauss’ Lemma (Lemma 12), it is easy to verify that r and 
s may be modified to new polynomials r and 5 of the same 
degree in x but with coefficients that are polynomials in t, 
so that 


f(t, x) = r(t, x)s(t, x). 


(See Problem 5.) By the irreducibility of /, r(t, x) must be 
a constant. Since r has the same degree in x, we have 


r(t, x) = 


MO 
MO ’ 
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where p 0 is not identically 0, by the stopping criterion of 
the Euclidean algorithm. As in the proof of the latter, we 
can write 

p 0 (t) 

x)+T(t,x)f x {t,x)-j±+, 

where S and T are polynomials in x with rational 
functions of t for their coefficients. Excluding the finite 
set of values of t which are roots of any of the 
denominators or of p 0 (t), we conclude that / and f x cannot 
have any other common roots. For at such points the left 
side would be zero and the right side would not. Thus for 
all but a finite set of values, each value t = t 0 causes the 
g.c.d. of f(t 0 , x) and f x (t 0 , x) to be a nonzero constant. As 
noted at the outset, this completes the proof. i 

Careful scrutiny of the proof of the previous lemma 
shows that it still applies if the hypothesis on /(/, x) is 
weakened somewhat. In particular, it is sufficient to 
assume that /(/, x) is irreducible in x, meaning that we 
cannot write /(/, x) = g(t, x)h(t, x), with g and h each 
polynomials of degree at least one in x. For example, the 
polynomial over Q given by /(/, x) = tx 2 + t is irreducible 
in x even though it is not irreducible in the general sense. 
Several of our subsequent results may also be given with 
similar weakened hypotheses, but this would add 
unnecessary complexity both to their statements and their 
proofs. Since this generalization is not relevant to the 
application we have in mind, it will not be pursued 
further. 

It is also appropriate to remark at this point that the 
similarity in the arithmetic of polynomials in x with 
coefficients in a field F and polynomials in x with 
coefficients that are rational functions in t suggests the 
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possibility of extending the definition of field so as to 
include both cases. For example, the set of rational 
functions in t over F, say, is closed under addition, 
subtraction, multiplication, and division (except by 0), 
and thus it has the same basic arithmetic structure as the 
sets of numbers we have called fields. This generalization 
of the notion of a field is very powerful and valuable, and 
it represents the point of view one would encounter in the 
subject of abstract algebra. For our purposes it is more 
convenient to deal with the concrete notion of a field of 
complex numbers, and so the abstract point of view will 
not be developed here. 

Next we see how the points of an affine plane curve C 
can be represented in the neighborhood of a regular value 
of t. 

Lemma 36b. Let t 0 be a regular value of the affine plane 
curve given by f(t, x) = 0. Then there exist n functions 
xft), x 2 (t), . . . , x„(t) which are analytic at t 0 and such that 
for every t in some neighborhood of t 0 , they represent the 
complete set of solutions to f(t, x) = 0. 

Proof. Without loss of generality we may take t 0 - 0, 
for the transformation g(t, x) = f(t + t 0 , x) may be used 
to put the problem in this form. Since 0 is a regular point, 
there are n distinct roots, which we may denote by 
jc i(0), x 2 (0), . . . , x n (0). We shall show that each of these 
gives rise to an analytic function xft) as described in the 
theorem. 

Let x(t) denote any one of the sought root functions 
Xj(t). Without loss of generality we may also assume that 
x(0) = 0, for the transformation g(l, x) = f(t, x + x(0)) 
may be used to obtain this form. Consequently, we now 
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seek a power series 

x (‘)= 2 b kt k 
k = I 

which converges on some set j/| < R, R > 0, within which 

/('.*(')) = o- 

Since by our assumptions /( 0, 0) = 0, we may write / in 
the form 


f(t, x ) = a, 0 t + a 0 t x + 2 a i/ xJ - 

i+j>2 

The sum on the right is a finite sum because / is a 
polynomial. Since t = 0 is a regular value, f x ( 0, 0) = a 0 , 
^ 0, for otherwise x = 0 would be a multiple root. 
Consequently we may divide / by — a 0 , without changing 
the solutions. Equivalently, without loss of generality we 
may take a 0 ! = - 1. 

Our proof has two main parts. First we determine a 
unique set of coefficients b k , k > 1, in order for x(t) to be 
a candidate for a solution to f(t, x(t)) = 0. Then we show 
that for this choice of the b k ' s, the resulting series actually 
has a positive radius of convergence and does satisfy the 
equation. 

From the equation 

f(t, jc) = a, 0 t - x + 2 a ,j‘' xJ 

i+j> 2 

it follows that 

/*('>*)= _1 + gi 1 ’*)’ 

where each term of g has degree at least one. By 
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substituting a b k t k into the equation f(t, x ) = 0, it 

follows that we must have 6, = a t 0 . For the only terms 
involving t raised to the first power are a, Q t and - b x t\ by 
the Identity Theorem for Power Series they must add to 0. 
Now we obtain a recursive definition for the b k s, k > 2, 
by taking a Taylor expansion of the polynomial in 
y,f(t,y), around the point y 0 = For the value 

y — 2,°1 \b,t‘ we obtain 



i + *K2V 

«•- ] 


2 V' 

i = k 


+ terms of degree at least 2k. 


By the Identity Theorem for Power Series, the coefficient 
of each power of t must be 0. In particular, the coefficient 
of t k is - b k plus the coefficient of I* in the expansion of 
the polynomial /(/, Therefore, b k must equal 

this coefficient. With this rule and the initial value 
b x = a, 0 , the entire sequence is determined. 

Let us make two further observations from this 
calculation. First, if the resulting series 2“-i t>kt k does 
have a positive radius of convergence, then 0 
= /(/, 2?- \b k t k ), for this latter is a power series every 
one of whose coefficients is 0. Second, it is easy to see 
inductively that each b k can be written in the abbreviated 
form p k (a,p, where p k is a polynomial in several variables, 
with positive integral coefficients, evaluated at the original 
coefficients a y. 
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Now we proceed to show that 2*= A** has a positive 
radius of convergence. We shall do this by constructing a 
power series that has a positive radius of 

convergence R and such that for each k, A k > \b k \. Thus 
for |/| < R, we will have 

2 IV*I < 2 \ A k * k I < oo. 

*=1 k = 1 

from which follows the convergence of 
Let A be any number greater than or equal to every 
single | flU. Following our previous calculation, but with 
each Uy replaced by A in the original equation, we see that 
if this new equation has an analytic solution at 0, the 
solution must have the form 2“-i A k t k with A k = p k (A) 
for each k. Thus if we can demonstrate independently that 
this equation does have an analytic solution at 0, the 
argument will be complete. The equation is 

0 — At - x + A ^ t‘x J 

i +j > 2 

and it is actually quite easy to solve. Using the formula 
for the sum of a geometric progression, we obtain 

00 00 00 / 00 

0 = At — x + At 0 2 At x ^ a 7 + A ^ f'l 2 
j - 2 j - i 1-2 \y=o 

-At- x + A -r— — At — — — — 

1 — A 1 — X 


+/l (Tb)(-nb)' 
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Manipulating algebraically, we have 


0 = At — Atx — x + x 2 + Ax 2 + Atx + A 



= [A + l]x 2 + [ - 1 ]x + 


A 


which is a quadratic equation in x. The solution satisfying 
x(0) = 0 is given by 


1 -V 1 -4 (A + l)v4//(l - t) 

x= 2(7TT) ■ 

The radic al may b e expressed as a quotient of functions of 
the form \j\ - Kt . From calculus (see also Problem 6) we 
know that Vl - u may be expressed in a p ower se ries in u 
for |u| < 1, from which it is clear that Vl - A ~t is also 
analytic at 0. By the validity of the arithmetic operations 
on power series, we conclude that x is analytic at 0. 

We have thus shown that to each x,(0) there 
corresponds an analytic root function x,(f) = x,(0) + 
} k • % ^eir continuity at 0 these functions are all 
distinct in some neighborhood of t = 0 and thus they 
represent the complete set of solutions in such a 
neighborhood. i 

The last preliminary result that we shall need is a fairly 
simple observation from interpolation theory, although its 
analytic statement appears somewhat complicated. Given 
m + 1 increasing values of the real variable t, t 0 
< ■ ■ ■ < t m , let us denote by V m the Vandermonde 
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determinant 

1 t t 1 ,m-\ ,m 

1 '0 ‘o • • • ‘0 ‘0 

i i, tf ... i, m “ i /, m 


For a given function z(t), we denote by W m the 
determinant obtained by changing the last column of V m 
as follows: 

1 / 0 tl ... t™ z(f 0 ) 

1 /, tf ... *('i) 

1 t m tl ... C" 1 

With this notation, we may state the lemma: 

Lemma 36c. If z(t) is m times differentiable on the 
interval t 0 < t < t m , then there exists some value 7, 
t 0 <7 < t m , such that 

z (m) (7) = W m 

m\ V m • 

Proof. First we prove the more general result that if 
/(/) is any m times differentiable function on the same 
interval, such thaty(/,) = z ( t : ) for every i, 0 < /' < m, then 
y (m \t) = z (m) (i) for some 7, t 0 < 7 < t m . For the Mean 
Value Theorem, applied to the function y — z on each 





196 


POLYNOMIALS WITH SYMMETRIC GROUPS Ch. 4 


interval [r,, t j+ ,], implies thaty (l) (0 and z (l) (r) agree on m 
distinct points. By the same argument, y (2> (0 and z (2 \t) 
agree on m — 1 points. Continuing in this fashion we 
obtain the desired conclusion. 

The lemma follows upon taking fory(t) the polynomial 
of degree m which interpolates to the values of z(t) at 
each In particular, to construct such a polynomial 
y(t) = a 0 + a t t + ■ ■ ■ + a m t m we simply need to solve 
the following linear system for the a- s: 

a 0 + a x t 0 + • • • + a m _ { tg ~ 1 + a m tg = z(t 0 ) 

a o + a l t ] + • • • + u m _ 1 t 1 m_l + a m t™ = z(/,) 


a o+ a i l m + 

By Cramer’s rule, 


+ a m-lC ' + a m l n 


*0m)' 


K, 

y m ' 


( V m is not 0 since V m — — tj) and the t’ s are 

distinct.) Since y (m) (t) has the constant value m\a m , the 
equation z <m) (7) = y (m) (7) yields the statement of the 
lemma. ■ 


PROBLEMS 

1. The function e‘ is sometimes defined by the power 

series t n /n\. Show that this series converges for all 

t. Then use multiplication of series to prove the law of 
exponents e a e b = e a+b . 

2. On the basis of the fact that power series can be 
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added term by term within their common radius of 
convergence, show that two series of the form f(t) = 
2*°- -/**** an d g(t) = 2“- -jbk t>c can a l so be added term 
by term. Here j > 0 and / and g both converge when 
0 < |r| < R. 

3. On the basis of the definition of series of the form 

i n the case when the a k ’s may not be real, 
justify the fact that two such series may be multiplied in 
the usual way within their common radius of convergence. 

4. Prove Taylor’s Theorem for the case of a poly- 
nomial: If / is a polynomial of degree n in y, then 

fO-)/ y \ 

f(y) = f(yo ) + f l \y 0 )(y ~ y 0 ) + — 2! - °- (y ~ y <>) 2 


+ 


+ 


f n \y 0 ) 

n\ 


(y-yo) n - 


Herey 0 is an arbitrary fixed value of y. 

5. Prove the following version of Gauss’ Lemma: If 
f(t, x ) is a polynomial in two variables over F and if / can 
be written as a product /(f, *) = r(t, x)s(t, x) where r and 
s are polynomials in x whose coefficients are rational 
functions of t over F, then in fact / may be written 
/(/, x) = r(t, x)s(t, x) where r and 5 are polynomials over 
F in two variables. Furthermore, r and r have the same 
degree in x, as do s and s. 

6. Prove that the function — t is analytic at 0. 

*1. Let f(t) be analytic at 0 and suppose /( 0) > 0. 

Show that / has an analytic square root at 0; that is, show 
that there is a function g(t ) defined in some neighborhood 
of 0 such that g(t) is analytic at 0 and [ g(t)f - f(t). 

8. Consider the algebraic curve C defined by 
/(?, x) = 0. A real point of C is a point where both t and x 
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are real. For example, the real points of t 2 - x 2 = 0 form 
two lines in the real t, jc-plane. The curve t 2 + x 2 = 0 has 
only (0, 0) for a real point. A real point is isolated if it has 
some neighborhood containing no other real points, as in 
the second example. Show that if /(/, x) has real 
coefficients, then for every regular value of t, any corre- 
sponding real point cannot be isolated. 

9. Suppose that y — g(t) is a polynomial of degree m 
over C. If there exist m + 1 values t i E Q at which 
g(0 e Q, show that the coefficients of g(t) must actually 
be in Q. 


Section 4.2. Hilbert’s Irreducibility Theorem 

The theorem to be proved in this section may be regarded 
as the fundamental tool in our treatment of polynomials 
over Q with symmetric groups. We shall first prove a 
special case of the theorem as Lemma 36d, and by using 
this it will not be difficult to obtain the general result. 

Suppose that f(t, x) is an irreducible polynomial over Q 
in two variables. It is useful to observe that / may be 
transformed into another polynomial over Q that has 
certain convenient properties. Let f(t, a) be written in the 
form 

f(t, x) = a„(t)x n + a„_ ,(/)•*”' ' + • • • + a 0 (t), 

where each a t (t) is a polynomial over Q, and suppose d is 
the highest degree of any of these polynomials a,(f). Since 
/ is irreducible, all but a finite number of t values are 
regular, and so we may pick a regular value t 0 which is 
rational. The function g(t, x) defined by 

g{t, x) = t d f(t 0 + y , a) 
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is readily seen to be a polynomial over Q. It has three 
important properties. First, it has n distinct root functions 
that are analytic at infinity; we thus say that g has a 
regular value at infinity. To see that this is the case, note 
that for values of the variable t 0 + (\/t) sufficiently close 
to f 0 , the solutions to the equation f(t 0 + (1/r), x) = 0 may 
be expressed as power series in the variable t 0 + (l/t)- 
t 0 , which is simply 1 / 1 . Second, g(t, x) is also irreducible. 
For if we could write 

g(t, x) = r(t, x)s(t, x) 

with r and s each of degree at least one, we could also 
write 


/(', *)-(<- ‘o) d g[ J~ . 

Since the sum of the highest degrees of the coefficient 
polynomials in r and j must equal d, it follows that the 
above expression would imply a nontrivial factorization of 
f(t, x), contrary to our hypothesis. Third, if t, + t 0 is a 
rational value at which the polynomial g(/„ x) is 
irreducible in Q[x], then the corresponding rational value 
/, = t 0 + (V , i) * s suc ^ that /(/,, x) is also irreducible in 
Q[x], This follows from the above equations upon setting 
t = t x . 

We are now ready to prove Hilbert’s theorem for the 
case of two variables: 

Lemma 36d. Let f(t, x) be an irreducible polynomial in 
two variables over Q. Then there exist an infinite number of 
rational values t 0 such that f(t 0 , x) is irreducible in Q[x]. 
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Furthermore, if fft,x),...,f M {t,x) are M such poly- 
nomials, then there are an infinite number of rational values 
t 0 such that fft 0 , x), . . . , f M ( t 0 , x) are all irreducible in 

Q14 

Proof. As usual, we write f(t, x) in the form 

f(t, x) = a n (t)x n + a„_ x (t)x n -' + • • • + a 0 (t). 

On the basis of the comments preceding the lemma, we 
can assume without loss of generality that / has a regular 
value at infinity. Thus for all t greater than or equal to 
some T 0 , there are n distinct root functions x,(t), x 2 (t), 
. . . , x„(?) and each may be expressed as a reciprocal 
power series. For t > T 0 , we also know a n (t) ^ 0. For 
each fixed t, f(t, x) can certainly be factored in C[x] as a 
product of linear factors; in particular we can write 

f(t, x) = a n (t)fl[x - Xi (t)] 

/ = i 

for all t > T 0 and all x. Because f(t, x) is irreducible over 
Q, the product XI"=i[-* - *,(0] cannot be written as 
a product of nontrivial polynomials over Q. Conse- 
quently, if S is any nonempty proper subset of the set 
{1, 2 then the factorization 

n[x -*,.(/)] = n o -*,(')]• no-*,.(o] 

1=1 /ss i$S 

does not represent a factorization into polynomials in two 
variables over Q. Each of the factors is a polynomial in x, 
of course, and so at least one of the coefficients is not a 
polynomial in t over Q. But even further, one of the 
coefficients is not even a rational function in / over Q, 
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because if they all were rational functions, our recent 
version of Gauss’ Lemma (Problem 5, Section 4.1) would 
yield a nontrivial factorization of /(/, x). There are 2" — 2 
such sets 5 and thus N = 2"~ l - 1 distinct factorizations 
of the above type. For each such factorization, we choose 
one coefficient function that is not a rational function of t 
over Q. In this way we obtain a list of functions y i(0> 
y 2 (t), ■ ■ ■ >/n( 0- By converse reasoning, if t 0 > T 0 is a 
rational value of t such that each of the numbers y x (t^, 
j 2 (fo)> ■ ■ ■ >y N (t 0 ) is irrational, then f(t 0 , x) is irreducible 
in Q[jc], To see this note that a n (t 0 ) is a nonzero rational, 
and thus a factorization of /(/ 0 , x) in Q[jc] would imply a 
factorization of H"_ ,[jc - x: ( (/ 0 )] in Q[xj. But without loss 
of generality we need only consider factors of the form 
Rest* ~ ■*,(')] and n,^ s [x — jc,(/)], and these possi- 
bilities are all excluded by the choice of t 0 . 

Thus the first part of the lemma will be proved if we 
can find an infinite number of rational values t 0 > T 0 
such that all the numbers j,(/ 0 ) are simultaneously 
irrational. We shall actually manage to do this with 
integers t 0 . Let us begin by taking an arbitrary /,(0> 
henceforth denoted y(t), and studying how many integers 
t 0 > T have the property that/(t 0 ) is rational. In a certain 
sense this set is small. 

The function y(t) is obtained from some subset of the 
root functions x,(/) by no more than addition, subtraction, 
and multiplication. From this we deduce two things: y(t) 
may be represented by a reciprocal power series for 
t> T 0 , and for each such t that is rational, y(t) is 
algebraic over Q. This latter fact follows from Theorem 8. 
By reviewing the proof of this same theorem, we further 
see that we can construct a polynomial in y with 
coefficients which are rational functions of t over Q, such 
that for each /, y(t) is a root. Thus we have that/(0 is a 
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root of an equation of the form 

dj{t)y j + dj _ ,(/)/"' + • • • + d 0 (t) = 0. 

Without loss of generality we may take the coefficient 
functions to be polynomials, and then with integral 
coefficients, for multiplication through by a suitable poly- 
nomial over Q would convert rational d’ s to this form. 
Now if we multiply this equation through by [dXt)Y~ 1 and 
make a new function z(t) = dj(t)y(t), we have z(t) as a 
solution to a polynomial equation 

z J + bj_ ,(/)z 7_ 1 + • • • + b 0 (t) = 0 

where the b? s are polynomials in t with integral coef- 
ficients. There is a real advantage to working with the 
function z(t) rather than y(t). If t 0 is an integer for which 
j>(/ 0 ) is rational, then z(t 0 ) has to be an integer. To see 
this, first note that z(t Q ) is rational and each is 

integral. By the Rational Roots Theorem (Problem 2, 
Section 1.5), the only possible rational values of z(t 0 ) are 
integers. 

Thus we now want to show that the set of integers 
t Q > T 0 for which z(? 0 ) is an integer is ‘small’. From the 
definition of z(t ) as the product of a polynomial and a 
reciprocal power series, we can write it in the form 

z(/) = c*f*+ • • • + c I / + c 0 + c_,/ _1 + • • • . 

If z is actually a polynomial, then one of its coefficients 
must be irrational, for otherwise y(t) = z(t)/d(t) would 
be a rational function over Q, contrary to our hypothesis. 
But if z is a polynomial with an irrational coefficient, then 
by Problem 9 of Section 4.1 there can only be a finite 
number of integers t 0 for which z(t 0 ) is an integer. In this 



Sec. 4.2 HILBERT’S IRREDUCIBILITY THEOREM 


203 


case we can pick a T x > T 0 such that for all integers 
t 0 > T u z(t 0 ) is irrational. 

If any coefficient c, is not real, then it is also the case 
that there can be only a finite number of integers i 0 > T 0 
for which z(t 0 ) is an integer. To see this, let i be the largest 
subscript of such a c,. Then lim,^ lm{z(/)/t'} = Im c,, 
and so for all t greater than or equal to some T\ > T 0 , z(t) 
is not even real. 

The final case then is where all the c/s are real and 
where at least one c, with a negative subscript is not 0. In 
this case, by differentiating z(t) a sufficient number of 
times we can eliminate all nonnegative powers of t. The 
result can be written 



where p is a nonzero constant, q is a positive integer 
(actually > m + 1), and the dots represent terms with 
higher powers of \/t. Since lim ,^ x t q z ( - m \t) - p, there 
exists a T { > T 0 such that for all t> 7’,, 0 < \z (m \t)\ 
<2\p\/t“. 

It is now time to make use of Lemma 36c. If there are 
an infinite number of integers t > T ] such that z(t) is also 
an integer, let us determine how much they must be 
spread out. Let t 0 < t x < • • • < t m be m + 1 such 
integers. Using the notation of Lemma 36c, we see that 
W m is a nonzero integer and so | W m \ > 1. Thus we have 

M > 2 \p\ ^ < z(m) ( 7 )l , i 

m\ tft ' m\ t q ' ml |F m | 

Therefore we also have 


ml 

2 \P\ 


>> J 
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This implies that there are positive constants a and ft such 
that t m — t 0 > at$. Since the right side increases with / 0 , 
we see that for larger values of t, these t’s become more 
spread out. In particular, choose a number T 2 > T, such 
that aT% > Nm. (Recall that N is the number of functions 
K(O-) Then any string of Nm + 1 consecutive integers 
greater than or equal to T 2 contains at most m integers t, 
for which y(t) is rational. 

Now we shall demonstrate the existence of integers t 
such that for all j, is irrational. For those y's which 
only have a finite number of integral values t such that 
yAt) is rational, choose a number T 2 larger than all of 
these values. For the other y's, pick a value of m that will 
simultaneously serve in all cases. The result of the above 
computation yields a value T 2 . Let T* be the maximum of 
all these T 2 s. Then any string of Nm + 1 consecutive 
integers greater than T * contains at most m integers t i 
yielding rational values for any one function Since 
there are N functions, there is at least one point in each 
such set at which the functions are simultaneously 
irrational. This then completes the proof for a single 
function /. In the case of M functions /, we simply work 
with the union of the y,’s corresponding to each one. 
Letting the total number now be represented by N, the 
above argument applies and gives the desired conclu- 
sion. ■ 

We have proved Hilbert’s Irreducibility Theorem for 
the case of a polynomial in two variables, and we want to 
develop the analog for polynomials in n + 1 variables. To 
this end we shall use a transformation, called Kronecker’s 
specialization, for converting a polynomial in n + 1 
variables to a polynomial in two variables such that the 
relevant irreducibility properties are essentially preserved. 
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Let /(w 0 , tq, . . . , m„) be a polynomial over Q in n + 1 
variables. In what follows, let us think of it as a poly- 
nomial in the n variables u„ u 2 , . . . , u n which has 
coefficients that are elements of Q[« 0 ], Each term consists 
of some polynomial in u 0 over Q multiplied by an 
expression of the form u['u 2 2 ■ ■ ■ u‘ n \ Let d be a number 
such that every one of the exponents /,, for every 

one of the terms is less than d. Fixing this number d, let P d 
be the set of all polynomials over Q in u 0 , u n 
having this same property. Then for any polynomial 
g(u 0 , u n ) in P d we define a new polynomial 
g(u 0 , y) by the equation 

= g(«» >y>y d >y d \ • • • >y d " ')- 

Thus g is a polynomial in two variables u 0 and y. 
Considered as a polynomial in y with coefficients which 
are elements of Q[w 0 ], the degree of g in^ is less than or 
equal to d n - 1. To see this note that under our 
correspondence a monomial u\'u' 2 2 • • • u' n " is associated 
with a term j' l+d ' 2+ ‘ /2 ' 3+ " d " V The exponent of y has its 
maximum value d n - 1 when each ij equals d - 1. The set 
of all polynomials over Q in u 0 and y with degree at most 
d" — 1 in/ will be denoted K d . 

If we replace u 0 by a rational constant « in the original 
polynomial g, and if we define the transform of the 
resulting polynomial in the obvious way to be g(a, y, y d , 
y dl , . . . ,y d " '), it is clear that this is the same polynomial 
in y that would be obtained by first transforming g and 
then substituting u 0 = a. This polynomial is of course 
g(a,y). The properties to be explained below hold both 
for the case when u 0 is a variable and when u 0 has been 
given a specific rational value a, and even the notation P d 
and K d will be understood to refer to both cases. 
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The mapping from P d to K d given by is easily 

seen to be a one-to-one correspondence; that is, it is both 
one-to-one and onto (Problem 1). Furthermore, if a poly- 
nomial / G P d can be factored over Q, so that we can 
write / = gh for g, then f can also be factored: 

f=(g h ) = gh- 

(Problem 2.) Each of g and h has degree at least one in 
some one of the variables m,, m 2 , . . . , u n if and only if g 
and h each have degree at least one in y. 

The converse statements do not hold: f may be 
reducible even if / is not. For example, suppose f may be 
written 

/= GH 

for G, H e K d . It is certainly true that there exist 
g, h E P d such that g-G and h = H. However, it is 
possible that the product gh may not be in P* in which 
case we are not led to a factorization of /. For example, 
consider the polynomial /(«,, u 2 ) = u 2 + uj, which for 
simplicity does not even contain the variable u 0 . With the 
value d = 3, we obtain the transform f(y) = y 2 + (y 3 ) 2 
= y 2 4- y 6 . This is obviously reducible over Q, whereas the 
original polynomial /(«,, w 2 ) is not. To see what happens 
when we factor / into a product GH and then construct 
the corresponding g and h in suppose we look at the 
factorization / = GH with G(y) - y and H(y)=y+y 5 . 
By the definition of our transformation, it is easy to see 
that g = G for g(u ,, w 2 ) = h,, and h = H for h(u y , u 2 ) 
= «[ + ufu 2 . Now g and h are both in P d> but their 
product 


(gh)(u v « 2 )= u 2 { + u]u 2 
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is not, since the exponent of h, in the second term is not 
less than d. Thus we cannot have the factorization/ = gh. 
This same phenomenon occurs with the other factoriza- 
tion of /, namely, f(y) = y 2 0 +T 4 )- (See Problem 3.) 
These observations lead us to formulate the following 
precise relationship between the reducibility of / and /, 
called Kronecker’s criterion: 

Lemma 36e. In the terminology of the preceding 
discussion , f is irreducible as a polynomial in u 2 , ... ,u n 
if and only if every nontrivial factorization of its transform, 
f = GH, leads back to a product gh which is not in P d , that 
is, which contains some one of the variables w,, u 2 , . . . , u n 
raised to a power greater than or equal to d. 

_ Proof. If / is reducible, say / = gh, then as noted above 
/ = gh, which implies that / is reducible. On the other 
hand, if / is irreducible, then any factorization f = GH 
= gh must necessitate that gh $ P d , for if gh the fact 
that the transform is one-to-one would imply that / = gh, 
contradicting the irreducibility of /. i 

Let us now use these ideas to complete the proof of the 

Hilbert Irreducibility Theorem: 

Theorem 36. Let f(t x , t 2 , . . . , t n , x ) be an irreducible 
polynomial over Q in the n + 1 variables t 2 , . . . , t n , x. 
Then there exist an infinite number of sets of rational values 
a p a 2 , . . . , a n such that /(«,, a 2 , . . . , a n , x) is irreducible 

in Q[x]. 

Proof. The proof will be by induction on n. When 
n = 1, the theorem reduces to Lemma 36d, the case of 
Hilbert’s theorem for two variables. The induction step 
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consists of reducing the case of n to the case of n - 1 by 
showing that there are an infinite number of rational 
values a such that /(a, t 2 , ■ ■ ■ , t n , x) is irreducible over Q. 
For this step it is convenient to relabel the variables as 
« 0 , and to adopt the notation of the previous 

discussion. 

Thus we have that f(u 0 , u n ) is irreducible. If it 

happens that f(u 0 ,y) is irreducible over Q, then Lemma 
36d implies the existence of an infinite number of rational 
values a such that f(a,y) is irreducible in Q[/]. By 
Lemma 36e, for each such value a the polynomial 
/(a, u n ) is irreducible, which is what we want. 

If /(m 0 , y) is reducible, then a more complicated analysis 
is necessary. In particular, by repeatedly factoring over Q 
until this is no longer possible, we can write 

f(u 0 ,y) = nc,(K 0 ,;) 

i 

where each G, is irreducible over Q. By Lemma 36d, there 
is an infinite set & of rational values a for which all the 
polynomials G,(a,y) are simultaneously irreducible in 
Q[y]. We claim that for all but a finite number of these 
values a, the polynomial /(a, u„) is irreducible as 

a polynomial in n variables. The crucial tool will be 
Lemma 36e. 

There are only a finite number of essentially different 
ways in which /(a, y) might be factorable over Q into two 
nontrivial factors. To see this, suppose we have a 
factorization 

f (a, y) = A (y)B(y), 

where A and B are polynomials in Q[ y], Since we also 
have 

/(a,T) = nG,(a,y), 

i 
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and since each G,(a, y) is irreducible, some of the (7,’s are 
factors of A(y) and the others are factors of B(y) 
(Theorem 14). That is, we can write 

A(y) = aUG,(a,y), 

i£S 

B(y) = a~' II G,(a,y), 
ie t 

for some rational number a and some disjoint sets S and 
T whose union contains all the subscripts i. Without loss 
of generality we may take a = 1 by dividing the original 
A(y) by a and multiplying the original B(y) by a. 
Consider the corresponding factorization of f(u 0 ,y): 

f (“o >T) = ( Ft C,(u 0 ,y))( Ft G/K,/)) 

and call the two factors in parentheses on the right 
Q( u 0 ,y) and R(u 0 ,y) respectively. Since both Q and R 
are in K d , there must exist polynomials q(u 0 , u n ) 

and r(u 0 , n,, . . . , u n ) in P d such that q= Q and r — R. 
Note that Q(a,y) = A(y) and R(a,y) = B(y). 

Since /(w 0 , u n ) is irreducible, the product q(u 0 , 

u n )r(u 0 , u n ) cannot be in P d , by Lemma 

36e, and so some term has too high a power of at least one 
of the variables h, through u„. The coefficient of this term 
is a polynomial in u 0 , and so long as a is not one of the 
roots of this polynomial, the product q(a, u n ) 

r(a, u„ . . . , u n ) also will not be in P d . If now for each of 
the finite number of groupings of the subscripts into sets S 
and T we remove from the set & that finite set of a values 
just described, then all the remaining values of a will 
guarantee, by Lemma 36e, that /(a, w,, . . . , u n ) is 
irreducible. As noted earlier, this then completes the 
proof. i 
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PROBLEMS 

1. Verify that Kronecker’s specialization gives a 
one-to-one correspondence between the sets P d and K d . 

2. If g, h, and gh are all in *v. show that (gh) = gh. 

3. In the example immediately preceding Lemma 36e, 
verify that the factorization f(y ) = y\ i + y 4 ) leads back 
to a product gh which is not in P d . 

4. This problem refers to the proof of Theorem 36. 
Define g,(u 0 , u„ , u n ) by the requirement that gj(u 0 ,y) 
= Gj(u 0 ,y). Does it follow that q(u 0 , u n ) = 

ILesS,(«o> «!.•••> M„)? 

*5. Let f(t, x) be an irreducible polynomial in two 
variables over Q, and let DC be the set of all rational 
numbers a such that f(a, x) is irreducible as a polynomial 
over Q in x. DC is called a basic Hilbert set. Lemma 36d 
shows that DC is infinite. Prove that DC is dense in the real 
line R. (A set S c R is said to be dense in R if for every 
r 6 R there exists a sequence of numbers s n G S 
converging to r.) 


Section 4.3. Existence of Polynomials over Q with 
Group S„ 

In this section we shall see that for every positive integer n 
there exists a polynomial over Q whose Galois group is 
S„. As noted earlier, this implies that for every n > 5, 
there is a polynomial over Q none of whose roots can be 
expressed in radicals. Another implication will be to show 
that for every m > 2, there exist numbers a such that 
deggct = 2 m but a is not constructible. Although a number 
of the preliminary results we need have been anticipated 
by problems in Chapter 3, they will be repeated here so 
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that the entire argument may be more easily compre- 
hended. 

Let F be a field. We say that the n complex numbers a,, 
a 2 , . . . , a n are algebraically independent over F if there is 
no nontrivial polynomial p(x x , x 2 , . . . , x n ) in n variables 
over F such that p(a x , a 2 , . ■ . , a n ) = 0. In other words, 
there is no algebraic relation among the numbers a x , 
& 2 ’ • • • > a „ . 

Lemma 37a. If F is a countable field, then for every n 
there exist n algebraically independent elements over F. 

Proof. Since F is countable, the set of numbers 
algebraic over F is also countable (Problem 4, Section 
1.5). Since C is uncountable, we can find a number a, 
which is transcendental over F. The extension F(a,) is 
simply the set {f(a x )/g(a x ) | f,gE F[x], g * 0}, and it is 
an elementary countability exercise to show that this set is 
countable. Thus we can apply the same argument to find 
an element a 2 which is transcendental over F(a,). 
Continuing in this fashion we can generate a sequence of 
numbers a x , a 2 , ■ ■ ■ , a n such that a n is transcendental over 
F(a,, a 2 , . . . , a n _ x ). To show that these numbers are 
algebraically independent, let us suppose to the contrary 
that there is a nontrivial polynomial relation among them: 
p(a x , a 2 , . . . , a n ) = 0. If k is the largest subscript such 
that a k actually appears in this relation, then we could 
conclude that a k is algebraic over F (a,, a 2 , . . . , a k _ x ), a 
contradiction. i 

Next we see how permutations of algebraically 
independent elements give rise to field automorphisms. 

Lemma 37b. Let a x , a 2 , . . . , a„ be algebraically inde- 
pendent over a field F, and let E = F(a„ a 2 , . . . , a n ). Then 
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each of the n \ permutations of the list a v a 2 , . . . , a n induces 
an automorphism on E which leaves F fixed. 

Proof. Any permutation can be accomplished by a 
sequence of transpositions (Problem 8, Section 1.6). Thus 
it suffices to show that every transposition induces an 
automorphism, for the composition of automorphisms is 
an automorphism. Moreover, it is apparent that the 
sequence of simple extensions leading from F to E by each 
of the elements a,, a 2 , . . . , a„ may be taken in any order. 
Therefore, in treating the transposition which in- 
terchanges a t and aj, We may for convenience define K as 
the extension of F by all the other a’ s, and then write 
E = K(a„ a) = K(c, d), where we have set c = a„ d = aj. 

The set E may be represented as E = { p(c, d)/q(c, d) | 
p, q polynomials over K in two variables, q # 0}. The 
natural candidate for the automorphism <j> induced by the 
transposition which interchanges c and d is the mapping 

P(c,d) p(d, c) 

<?( c ’ d ) ?( d ’ c ) ' 

It is necessary to show that this mapping is well defined, 
one-to-one and onto, that it preserves addition and 
multiplication, and that it leaves elements of F 
unchanged. 

To show that </> is well defined, we must consider the 
possibility that an element of E has two different 
representations, say 

P\{c, d ) = Pi( c, d) 
q x {c,d) q 2 (c,d) ' 


We want to show in this case that both representations 
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lead to the same value of the image under <f>, that is, 

P \( d , c ) = Pi(d, c) 
q\(d,c ) q 2 (d,c) ' 


The first equation implies that (c, d ) is a solution to the 
polynomial equation p x q 2 - p 2 q\ = 0. But since c and d 
are algebraically independent, the polynomial p x q 2 - p 2 q x 
must be identically 0. Therefore ( d , c) is also a solution, 
from which we deduce the second equation. 

By reversing the argument we also see that <f> is 
one-to-one. It is obvious that <j> is onto and that it leaves F 
fixed. To show that <j> preserves addition, we need to 
verify that 


Pi(t’d) p 2 (c , d) 
q\{c,d) q 2 (c,d) 


Pi(d, c) p 2 (d, c) 

q x (d,c) q 2 (d,c) ' 


To apply <t> on the left side, we first need to cross multiply 
and add the fractions; then we interchange c and d, after 
which we can rewrite again as a sum. It should be obvious 
that this yields the expression on the right, and so a 
formal argument is omitted. Preservation of multiplication 
is verified in the same way. I 


From now on we shall take the field F to be Q. Since Q 
is countable, for any n we can choose n algebraically 
independent elements over Q. The symbols a v a 2 , . . . , a n 
will be used throughout to represent a specific choice of 
such elements, and we shall let E = Q(a,, a 2 , . . . , a n ). For 
each / from 1 to n, let us define 

^ = (~ 1 )' a -( a „ a 2 ,...,a n ), 

where a, is the ith elementary symmetric function of n 
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variables (Section 1.6). Thus the numbers a,, a 2 , . . . , a n 
are the roots of the polynomial f(x) = x n + b t x n ~ 1 + 
• • • + b n , which is a polynomial over the field K = 
F (b it b 2 ,..., b n ). 

Lemma 37c. The polynomial f is irreducible over K. Its 
group over K, namely G(E/K), is S n . 

Proof. The first statement follows immediately from 
the second. By Lemma 37b, each permutation of the a,’s 
gives rise to an element of G(E/F). However, the fixed 
field is actually larger. Since each permutation leaves 
symmetric functions invariant, all the bf s are left fixed, 
and consequently so too are all the elements of K. ■ 

Since E is a finite extension of K, it can be written as a 
simple extension. The next lemma asserts that this can be 
done in a particularly simple way: 

Lemma 37d. There exist integers m v m 2 , . . . , m„ such 
that E = K(m,a, + m 2 a 2 + • • • + m„a n ). The sum m ] a ] + 
m 2 a 2 + • ■ • + m n a n assumes nl different values under the 
n \ possible permutations of the a/s. 

Proof. In the proof of Theorem 18, it was shown how 
multiple extensions may be written as simple extensions. 
The argument there actually shows that for all but a finite 
number of n-tuples (m v m 2 , . . . , m n ), the above simple 
extension equals all of E. 

Since [E : K]= |G(E/K)| = n\, the degree of 
over K is nl. Thus it has n\ distinct conjugates. However, 
the conjugates are the images of 27- i m ,a, under the 
automorphisms in G(E/K), and such automorphisms map 
27-i w , a , t0 similar sums with the a/s permuted. Since the 
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conjugates must be distinct, all these sums must be 
distinct. ■ 

Let us denote by c,, c 2 , . ■ ■ , c n! the n\ distinct values 
mentioned in the previous lemma. We define g(x ) 

Lemma 37e. The polynomial g is irreducible over K. Its 
group over K is G( E/K). 

Proof. Since permutations of a/s only lead to permu- 
tations of the c,’s, the coefficients of g are symmetric 
polynomials evaluated at the af s. Since the af s are the 
roots of a polynomial over K, these coefficients are in K 
(Corollary 9a). The group of the polynomial g over K is 
<5 (E/K) since E is the splitting field of g over K. The 
polynomial g is irreducible since it has a root, m 1 a l + 
m 2 a 2 + ■ • ■ + m n a n , of degree n\ over K, and thus which 
cannot be a root of any nontrivial lower degree factor of g 
in K(x]. ■ 

Thus we note that both / and g, of respective degrees n 
and «!, have the same group over K. We are now going 
to construct two polynomials F(t v t 2 ,..., t„, x) and 
G(t v t 2 , . . . , x) in n + 1 variables over Q. They will have 
the property that their respective degrees in x are also n 
and n ! and that substitution of the values t, - b, for each i 
reduces them to the polynomials / and g. 

In particular, let us begin with n variables s t , s 2 , . . . , s n . 
We define n functions f, by the equation 

/, = ( — 1) o,(i„ s 2 , . . . , s n ) 

for each i from 1 to n. We also define n\ functions w, by 
the sum 27- i m , s , and all the analogous sums obtained by 
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permuting the s,’s. Using these variables, we can construct 
F and G : 

F(t„ t 2 , . t n , x) = x" + t l x"~ l + t 2 x"~ 2 + • • • + t„ 

= n (x - 

<=i 

n! 

G(t\, t 2 , , t n , x) = II (*-«,)• 

i« 1 

Whereas we have a simple explicit representation of F in 
terms of t v t 2 , . . , , t n and x, for G we do not. We know 
that the coefficients of the powers of x in G may be 
written as polynomials in the s’s, and that they are 
symmetric in the s,’s. By the Fundamental Theorem on 
Symmetric Functions, then, we do know that G is a 
function of t x , t 2 , and x. The polynomial F is 

called the generic polynomial of nth degree. 

Lemma 37f. The polynomials F and G are irreducible 
over Q. 

Proof. Suppose we could write a nontrivial factoriza- 
tion 

F(t v t 2 ,..., x) 

= R(t x , t 2 , . . . , t n , x)S(t x ,t 2 ,.. x) 


where R and 5 also have rational coefficients. Then this 
formula would have to hold for all particular values of t x 
through t n , and specifically when for each t, = b t , with b t 
as defined earlier. If R and S each have degree at least 
one in the variable x, then the substitution /, = b, gives a 
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factorization of /, a contradiction to Lemma 37c. If S has 
degree 0 in x, say, then the coefficient of x n in the 
product has the form r(/,, . . . , /„)$(/,, . . . , /„), and this 
must equal 1. Since the degree of rs is the sum of the 
degrees of r and s, and since this sum must be 0, we 
conclude that r and s are nonzero constants. But if s is a 
constant, so is S; and thus the factorization F = RS is a 
trivial one. 

The argument for G is identical. ■ 

We shall actually only need the part of this lemma which 
concerns G. 

It is now rather simple to give the answer to our 
question about polynomials over Q with group S„. 

Theorem 37. For every positive integer n there exists a 
polynomial over Q whose Galois group is S„. 

Proof. By Hilbert’s Irreducibility Theorem, we may 
choose rational numbers / 8 „ P 2 , . . . , P„ such that the 

polynomial G(x) = P 2 /?„, x) is irreducible 

in Q[jc]. We shall see that the polynomial F(x) - 

Pi, ■ ■ • , Pn, *)" •*" + P I*""' + P 2 X n ~ 2 + 

• • • + P n has Galois group S n . 

Let a 1 ,a 2 ,...,a„ denote the roots of F. Then the num- 
ber /Wjcq + m 2 a 2 + • • • + m n a n is in the splitting field 
Q(a 1 ,a 2 ,...,a„). By the definition of G and the u,’s, this 
number is a root of G, which is an irreducible polynomial 
of degree n\ over Q. Therefore [Q(a l; a 2 ,..., a„): Q] > n! 
and so the Galois group of F has order >n\. But since 
this group corresponds to a set of permutations of 
a u a 2 ,...,a„, its order is also at most n\. In order for the 
group to have this order, therefore, it must be precisely S„. 
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Let us state as corollaries the implications of this 
theorem for the problems of solvability by radicals and 
constructibility. 

Corollary 37a. For every n > 5 there exists a 
polynomial over Q of degree n such that none of its roots 
can be expressed in radicals. 

Proof. By Theorem 37 there always exists a poly- 
nomial over Q with group S„. For n > 5, the group S„ is 
not solvable (Lemma 35b). By Galois’ Theorem, the 
polynomial is not solvable by radicals, meaning that at 
least one root cannot be expressed in radicals. But since 
the group is S n , the polynomial is irreducible (Problem 6, 
Section 3.6). Therefore (Problem 5, Section 3.6), none of 
the roots can be expressed in radicals. a 

Recall that in Theorem 20 it was shown that a 
necessary condition for a number to be constructible is 
that its degree over Q be a power of 2. The following 
corollary shows that this condition is not sufficient. 

Corollary 37b. For every m > 2, there exists a 
number which has degree 2 m over Q but which is not 
constructible. 

Proof. Let n = 2 m and let /(x) be a polynomial over Q 
with group S„. Since / is irreducible, each of its roots has 
degree n = 2 m over Q. If all these roots were constructible, 
then every element of the splitting field E of / would be 
constructible, for all such elements can be expressed as 
rational combinations of the roots of /. The degree [E : Q] 
is «!, however, since it equals the order of the Galois 
group. If we write E as a simple extension of Q, E = Q(a), 
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then deg Q o( = «!. But since n > 4, n\ contains the odd 
factor 3, and thus n\ is not a power of 2. By Theorem 20 
then, a is not constructible. Thus at least one of the roots 
of / is not constructible. a 

An example of such a number has already been 
encountered in Problem 5 of Section 3.1. 


PROBLEMS 

1. In the context of Lemma 37b, are there any 
automorphisms of E leaving F fixed other than those 
induced by the n\ permutations of the a- s? 

2. Are the elements by, b 2 , . . . , b n , as defined in the 
text, algebraically independent over Q? 

3. Let H(t y , t 2 , . . . , t n , x) be a polynomial over Q in 

n + 1 variables. Let b v b 2 , . . . , b n be n complex numbers 
which are algebraically independent over Q. Define a 
polynomial h(x) - H(b v b 2 , . . . , b n , x), and consider it 
over K = Q (b v b 2 , , b„). What is the relation between 

the irreducibility of H and the irreducibility of hi 

4. Show that none of the roots of the polynomial 
determined in the proof of Corollary 37b can be 
constructed. 


REFERENCES AND NOTES 

Basic material on power series may be found in advanced calculus 
texts, such as [1, 2], for the real case, and in complex variables texts, such 
as [3, 4], for the complex case. References for our small excursion into 
algebraic geometry are [5, 6j. Hilbert’s original paper on the 
irreducibility theorem is [7]. Our treatment follows [8, 9] except that by 
working at regular points it has been possible to avoid introducing 
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Puiseux expansions with fractional exponents. An example of a 
polynomial over Q with group S 6 is f(x) = x 6 + 22 x 5 - 9x* + 12x 3 - 
37 x 2 - 29 x - 5, for which see p. 109 of [10]. Generalizations of the basic 
problem of this chapter are discussed on p. 262 of [ 1 1 ]. 
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SOLUTIONS TO THE PROBLEMS 

Section 1.1 

1. Use Lemma la to show that if a and b are 
constructive, then so too are a + b, a - b, ab, and, when 
b * 0, a/b. 

By hypothesis, segments of lengths |a| and |fc| may be 
constructed, beginning with a unit segment. To obtain the 
constructibility of the number a + b, we want to construct 
a segment of length |a + b\. If a and b have the same sign, 
\a + b\ = |a| + \b\, in which case the addition part of 
Lemma la gives the result. If a and b have opposite signs, 
then either \a + b\ = |a| - |6| or \a + b\ = |£| - |a|, de- 
pending on which difference is positive. In this case the 
subtraction portion of Lemma la applies. That a - b is 
constructible follows at once from the preceding, since 
a - b = a + (- b) and if b is constructible, then by the 
definition of constructibility so is -b. Multiplication is 
immediate from Lemma la, since \ab\ = \a\ • |fc|; and so 
too is division. 

2. If M and N are positive integers, show that if M fW 
is rational, then, in fact, it must be an integer. Conclude 
then that fl , fj , \2 , f6 , for example, are all irrational. 
(Hint: If M fN is rational it may be expressed in lowest 
terms as Q / R. If R ^ ± 1 it has a prime factor. If a prime 
divides a product of integers, it must divide one of them.) 

Following the hint, if M fN = Q/ R in lowest terms (say 
with R positive), then NR M = Q M . If a prime p divides R, 
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then it divides NR M and hence Q M . But then it must be a 
factor of Q. This contradicts the fact that Q/R is in 
lowest terms. Hence R = 1. 

_By jnspection, there are no integers equal to f2 , V3 , 
V2 , V6 . Hence they are irrational. 

*3. We know from Lemma 1 c that if F is a field, so 
too is the set {a + bfl \ a, b £ F}. What about the set 
{a + b\/2 | a, b £ F}? Is it ever a field ? Is it always a 
field ? Give as complete an analysis as you can. 

It is a field if and only if \J2 £ F. For, if ^2 £ F, it is 
just F itself. If it is a field, then by closure under 
multiplication, (pj2 ) 2 must also be an element of the field. 
Hence there must exist a and b in F for which 
2 2/3 = a + b2 ]/3 . Therefore (2 1/3 ) 2 - b( 2 I /3 ) - a = 0, and 
so 2 1/3 = (b ±Vfc 2 + 4a ) /2. Now , if ^b 2 + 4a £ F, it 
follows that j[2 £ F. If yjb 2 + 4a tf-F, then ]/2 is of the 
form A + Bfk with A, B, and k in F, fk not in F. That 
is, 2 = (A + Bfk ) 3 = C + D}fk for some C and D in F. 
But D = 0, for if not, then V k = (2 - C)/ D which is in F, 
a contradiction. But if we compute (A - Bfk ) 3 we get 

(A - Bfk ) 3 = C- Dfk 

= C 

= 2 , 

so that (A + Bfk ) and (A - B^k ) are both real cube 
roots of 2. By elementary calculus, there is a_ unique real 
cube root of 2, and so A + Bfk — A — Bfk . Therefore 
B = 0, for if it were not, we would have ^k = 0 £ F, a 
contradiction. Thus \2 = A £ F, which completes the 
proof. Hence for F = Q, it is not a field, by the 
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irrationality of . For F = R, it is a field, namely R 
itself. 


4. Let *3 be a collection of fields. Show that 
the common intersection of all the fields in < 3 r , written 
n Feff F, is itself a field. 

Call the intersection S. If a and b are in S, then for 
each F e 3F they are in F. Since F is a field, 
a + b, a - b, ab, and, when b ^ 0, a/b, are also in F. 
Since this holds for every F, they are also in S. Since each 
F also contains 1, so too does S. Hence S is a field. 


*5. Give an explicit description of the smallest field 
containing . (By the “ smallest ” field having a given 
property is meant the intersection of all fields with that 
property. Problem 4 guarantees that such an intersection is a 
field.) 

The answer may be expressed as F = [a + 62 1/3 
+ c2 2/3 | a, b, c G Q}. It should be clear that the desired 
field F must contain all elements of the form 
a + 62 l/3 + c2 2/3 . It remains then to show the set of such 
elements alone is itself a field. Closure under addition and 
subtraction is evident; it is almost as clear that the 
product of two such numbers has the same form, as 2 can 
be factored out of any term 2 4/3 in a product. The 
difficulty is in showing closure under division. This will 
follow from closure under multiplication if we can just 
show whenever a + £2 1/3 + c2 2/3 # 0, which happens to 
be the case exactly when a, b, and c are not all 0, then 
there exist rationals A, B, and C such that 


1 

a + 62 1 / 3 + c2 2 / 3 


= A + B 2'/ 3 + C2 2/3 . 


In fact, by multiplying both sides of the above by a 
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suitable rational (as the reader should check), we can 
assume without loss of generality that a, b, and c are 
integers and are relatively prime (that is, they have no 
common factor other than ± 1). (Of course, we cannot 
expect that A, B, and C will also be of this form.) 
Multiplying the equation through by a + h2 1/3 + c2 2/i 
and grouping terms yields 

1 = [aA + 2cB + 2bC] +[bA + aB + 2cC]2'/ 3 

+ [cA + bB + aC] 2 2 / 3 , 

which will have a rational solution A, B, C if the three 
linear equations 

aA + 2 cB + 2 bC = 1 
bA + aB + IcC = 0 
cA + bB + aC = 0 

have a rational solution in the unknowns A, B, and C. 
This will certainly be the case if the determinant of the 
coefficients 


A = 


a 

b 

c 


2c 2b 
a 2c 
b a 


is not 0, for then the solutions can be calculated from a, b, 
and c by rational operations, from which it follows that 
they are rational. We will show that the assumption A = 0 
leads to a contradiction of the fact that a, b, and c are 
relatively prime. Expanding A by cofactors of the first 
column, or by any other method, we obtain 


A = a(a 2 - 2 be) - b(2ac - 2b 2 ) + c(4c 2 - 2 ab) 
= a 3 — 6 abc + 2 b 3 + 4c 3 . 
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Now, if A = 0, then 2 divides a 3 , as it clearly divides the 
other terms. Hence 2 divides a. But then 4 divides the 
first, second, and fourth terms, and so 2 divides b 3 and 
hence b. But then 8 divides each of the first three terms, 
and so 2 divides c 3 and hence c. Thus 2 is a common 
factor of a, b, and c. As noted earlier, this contradiction 
completes the proof. 

*6. If p is a prime and N is a positive integer > 2, give 
an explicit description of the smallest field containing 
N \fp . ( This is a generalization of the previous problem.) 

The answer is F = (27- o' a jP^ N I eac ^ a j G Q}- As in 
the previous problem, the desired field clearly must 
contain this set. Thus it remains to show that this set is 
itself a field. Closure under addition, subtraction, and 
multiplication is readily established; but the difficulty lies 
in establishing closure under division. This reduces to 
closure under multiplication if we can show that every 
nonzero element has a reciprocal of the same form. That 
is, we want to show that whenever £ !j-o a jP J/N ^ 0 
(which happens to be the case exactly when not all the af s 
are 0), then there exist rationals Aj, 0 < j < N — 1, such 
that 

1 

a 0 + a lP '/ N + a 2 p 2 ' N + ■ ■ ■ + a N _ x p^~^ N 

= A Q + A x p'/" + A 2 p 2 ' N + ■ ■ ■ +A N _y N ~'V N . 

By multiplying both sides by a suitable rational (in fact 
the reciprocal of the least common denominator of all the 
af s, when expressed as the quotients of integers) we can 
reduce the problem to the case in which the af s are all 
integers. Then, by multiplying both sides by the greatest 
common divisor of the af s, the problem is reduced to the 



226 


SOLUTIONS TO THE PROBLEMS 


case where the aj s are relatively prime (that is, they have 
no common factor other than ±1). Thus, without loss of 
generality, we shall establish the existence of appropriate 
AjS for the case where the aj s are relatively prime 
integers. Multiplying through by the denominator above, 
and grouping appropriately, we obtain the equation 

1 = [ OqAq 4- pa N _ x A ! + pa N _ 2 A 2 + • • • + pa x A N _ , ] 

"b [ a l A 0 + 1 + P a N- \ A 2 ' ' ' + P a 2 Ay _ i 

+ [a 2 A 0 + a l A l + M 2 + ' * ' + /’Mv-i]/> 2/ " 

+ 


+ [oy-iAo + ay _ 2 A | + a N _ 2 A 2 


This will be solved by any solution of the following 


simultaneous linear 

equations : 

in the 

unknowns 

A 0 , A 

. . . , Ay_ 

i : 










+ 

pay_,A l 

+ 

P a N -2 A 2 

+ ■ • • 

+ 

pa 1 

a n- 

,= 1 

a, A 0 

+ 

a o A 1 

+ 

P°n-\A 2 

+ ■ • • 

+ 

P* 2 

Ay_ 

. = 0 

a 2 A 0 

+ 

Mi 

+ 

a 0 A 2 

+ • • • 

+ 

pa 3 

A y — 

, = 0 

O 

1 

* 

*3 

+ 

a N-2 A 1 

+ 

a N - 3 A 2 

+ • • ■ 

+ 

«o 

Ay_ 

,=0. 


A solution certainly exists if the determinant of the 
coefficients is not 0, in which case the values of the Ajs 
will all be rational, as follows from the fact that their 
explicit calculation only involves the application of 
rational operations to the coefficients. Thus our solution 



SOLUTIONS TO THE PROBLEMS 


227 


will be complete once it is shown that the determinant 

pa N - 1 pa N - 2 ••• P a i 

a 0 P a N-\ pa 2 

a i a 0 • • • P a 3 

a N- 2 a N- 3 • • • «0 

cannot be 0. This will be done by showing that the 
condition A = 0 leads to a contradiction of the assumption 
that the a- s are relatively prime. 

The basic tool is the usual definition of a determinant 
as a sum of products, each prefixed by an appropriate sign 
(plus or minus) and containing as factors exactly one 
element from each row and each column of the array. 
Observe that each term of the expansion of A contains 
explicitly the factor p, except for the term a". Thus (under 
the assumption A = 0), p divides afi , and hence p divides 
a 0 . Thus, in A, we may replace a 0 by pa’ 0 for some inte- 
ger a' 0 : 



a N- 1 



pa o 

P a N- 1 

P a N -2 

. pa t 


a \ 

P a 0 

P a N-2 

■ P a 2 

A = 

a 2 

«l 

pa 0 

■ pay 


a N- 1 

a N- 2 

a N- 3 

. . pa' 0 


But now every term of the expansion contains explicitly 
the factor p 2 , except for the term pa , w , gotten by taking the 
last element of the first row and then all the elements just 
below the main diagonal. Thus p divides a y, and so p 
divides a x . Continuing in this fashion, it is easy to see that 
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p divides each a j: which is a contradiction to their being 
relatively prime. 

7. Is {a + b^2 + cf 3" | a, b, c £ Q} a field ? 

No, because it would have to contain V6 (by closure 
under multiplication), and this is impossible. To see this, 
suppose that for some rational_numbers a, b, and c, we 
could write V§" = a + bfl + ci/3~ . It would follow that 

VfT - a = bfl + cy/3 , 

(V6 - a) 2 = (bfl + cyT ), 

6 — 2af6 + a 2 = 2b 2 + 2bc^6 + 3c 2 , 

6 + a 2 - 2 b 2 - 3 c 2 = 2(a + bc)f 6 . 

This contradicts the irrationality of except possibly 
when a + be = 0. But in this case, we also have 

6 + a 2 -2b 2 - 3c 2 = 0, 

6 + ( - be) 2 — 2b 2 — 3c 2 = 0, 

b\c 2 - 2) = 3(c 2 - 2), 

and so either c 2 = 2 or b 2 = 3, which are both impossible 
by the irrationality of V2 and i/3" . 

8. Give an explicit representation of the smallest 
field containing both }J2 and . ( Hint : Use one of the 
lemmas .) 

The required field F is 

{ a + bf2 + c\/ 3" + (bfb \ a, b, c, d GQ}, 
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To see this, we develop F by a sequence of extensions. Let 
F, = Q(V2~ ) and F = F,(V3" ). By the closure property, F, 
which by Lemma lc is a field, must be the answer. For, 
any field containing ^2 and fi must contain all the 
elements of F. As to the representation of the elements of 
F, it consists of the set of elements of the form 
(a + b^jl ) + (c + dyjl , where a, b , c, and d E Q, and 
thus is given precisely by the original description. 

9. For the example treated after Lemma 1 d, give a 
different development of the same number from the rationals 
which results in a different sequence of field extensions. 

The development: 

13,VU,6,V6 ,7,V7 ,V^T -VI3 , !, 

1, 2, 2/f , 1 + 2 V7 , \/l +2/T , V6 +\/l +2VT , 

i +}/r+w , i yv6 +vtt2^~, 
vTi + f yVs - +ViW 

gives the sequence of extensions: 

F 0 = Q, F, = F 0 (VI3), F 2 = F,(V6), 

f 3 = f 2 (VT), f 4 = f 3 (WtT ), 

F 5 = F 4 (Vl+2^ ), F 6 = F 5 (^fjf+y^+W~ )• 

10. Let F be a field. Is the set {a + bfk | a, b, k 
G F, k > 0} necessarily a field ? ( This differs from F(fk ) 
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because in the present case k is not fixed, but rather it 
ranges through all positive values in F. 

It is not a field, since it is not necessarily_even closed 
under addition. For suppose F = Q. Then f2 +f3 does 
not belong to this set, even though both and V3 
individually do. To see this, suppose we could write 
V2 +V3 = a + bfk for some rationals a, b, and k, k > 0. 
Without loss of generality, we may assu me k is an 
integer, as we can always replace yM/N by 
(1/ N)fM ■ N . Squaring once, we obtain 

2 + 2V6 + 3 = a 2 + 2 abfk + b 2 k. 

If either a or b is 0, this contradicts the irrationality of fb . 
Otherwise we have 

2f6 - 2 abfk = a 2 + b 2 k - 5 

(2y/6 - 2 abf/c ) = (a 2 + b 2 k - 5) 2 

24 - 8 abfbk + 4a 2 b 2 k = ( a 2 + b 2 k - 5) 2 

from which it follows that V 6&~ is rational and hence, by 
Problem 2, an integer. By considering the prime power 
decomposition of k, we see that 2 and 3, and hence 6, 
divide k; from which it follows that k = 6 M 2 for some 
integer M. Thus we have 

5 + 2V<r = a 2 + 2abMfb + b 2 (6M 2 ), 
or 

5 - 6M 2 6 2 - a 2 = (2 abM - 2)f6 . 

So, by the irrationality of , the right and hence the left 
side are both 0. From 2abM — 2 = 0, we get M = 1 / ab. 
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Using this, 

5 - 6M 2 b 2 - a 2 = 0 
5 — 6/ a 2 — a 2 = 0 

(a 2 f- 5a 2 + 6 = 0 

2 _ 5±V25^24 
2 

a 2 = 2 or 3, 

thus contradicting the irrationality of ^2 or \f} . 

11. The following argument is occasionally found in 
proofs of Theorem 1: “ Using only points in the plane of a 
field F, by a single fundamental construction it is only 
possible to construct points in the plane of some V(fk ), 
k E F.” Criticize this argument or use it to give a simpler 
proof of Theorem 1 . 

It is ambiguous at best, fallacious at worst. The 
difficulty is that on a single fundamental construction, it 
is possible to obtain points in the planes of several 
different quadratic extensions of F, for the new circle or 
line may intersect any number of previous ones, each case 
possibly leading to a different extension. 

12. What else would you like to know ? Spend a little 
time thinking about the material in this section , and, perhaps 
especially, in the above problems. Can you anticipate the 
solution of any of the classical construction problems : 
doubling the cube, trisecting an angle, squaring the circle ? 
Are you curious about anything ? Do any questions occur to 
you that seem to be interesting ? Make a short list of your 
questions and ideas, spending at least enough time on each 
to satisfy yourself that it is nontrivial. Keep these ideas in 
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mind, jotting down others as they occur to you as you 
proceed through the material in subsequent sections. 

There are many possibilities. Three of the most striking 
may be the following. What is the answer to Problem 6 
when p is not prime? For example, what about N = 3 and 
p = 4 or 6? What is the smallest field containing vr? In 
connection with Problem 9, do all such sequences of 
extensions have the same number of fields? The 
observation that one can always toss in irrelevant steps 
requires that this question needs rephrasing to rule out 
such cases. Assuming this is done, now what is the 
answer? Incidentally, Problem 3 essentially solves the 
problem of doubling the cube. 

Section 1.2 

1. Can the cube be “ tripled ”? 

No. This would require that V3 be constructive. But, 
by the identical argument as that used to develop Lemma 
2 and Theorem 2, this would imply that V3" is rational, 
which is a contradiction. 

2. Can the cube be “ quadrupled ”? 

No. This would require that \4 = ( jl ) 2 be construct- 
ive. By Lemma lb, then, ]/2 would be constructive, 
which was shown to be impossible in Theorem 2. 

Section 13 

1. It is obvious that by repeated bisection it is possible 
to divide an arbitrary angle into 4 equal parts. Show how 
this may also be deduced algebraically from the equation 
relating cos 46 and cos 6. 

We find cos 49 = 8 cos 4 # — 8 cos 2 # + 1 and so cos 2 # 
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= [8±V64-32(1 — cos 40)]/ 16. Since the operations 
involved here all correspond to possible constructions, 
cos 2 # can be constructed if we are given cos 4#. But then, 
by Lemma lb, cos 9 can also be constructed. 

2. Discuss the validity of the following proof of 
Theorem 3: “ By the continuity of cos 6, there exists an 
angle 8 0 such that cos # 0 = ( V 2 )/2. By Theorem 2, cos 9 0 
is not constructible . Hence 3 9 0 is an angle that cannot be 
trisected." 

It is invalid. Even though cos 9 0 cannot be constructed 
from a unit segment, it is conceivable that it might be 
constructed with segments equal to both 1 and cos 3# 0 
being given. 

*3. Prove that it is not possible, in general, to quintisect 
an arbitrary angle, i.e., to divide it into five equal parts. 

The angle between 0 and it whose cosine is -5/6 is 
constructible, since - 5/6 is a constructible number. Call 
this angle 50. If 5# were also quintisectible, then 0 would 
be constructible, or equivalently cos 8 would be a con- 
structible number. Setting x = 2 cos 9, the trigo- 
nometric addition formulas yield 2 cos 59 = x s - 5x 3 
+ 5x, so that in this case at least one of the roots of f(x) 
— 3.x 5 - 15x 3 + 15jc + 5 = 0 would be constructible. This 
polynomial has one root between 1.7 and 1.8 by the Inter- 
mediate Value Theorem, since /(1.7)<0 and/(1.8)>0 
by computation. Call this root 2 cos #,. Thus cos 9 { >.85 
and from tables 0 < #, < 32°. Letting 

0 — _ a a — ^77 , p 

"2“ 5 "i> “s ~ 5 + "i> 

a 477 _ p a — _i_ a 

“4 — 5 “5 ~ 5 + “i> 
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we make two observations. First, for all /', cos 50, = — 
5/6, and so all the numbers cos 0, are roots of /. Second, 
the numbers 0, are all on the interval from 0 to ir and they 
are distinct (follows since 0, < tt/ 5 = 36°), and therefore 
the numbers 2 cos 0, are the complete set of five roots of /. 
Third, if any 0, is constructible, they all are; this is 
because they differ by a multiple of tt/5, which is a 
constructible angle (w/5 = 36°, one-half the central angle 
of a regular pentagon, which is well known to be 
constructible). Putting it all together, if 50 is quintisect- 
ible, then all the roots of / are constructible numbers. 

The rest of the solution makes use of two very simple 
facts called to the reader’s attention in subsequent 
sections. The first is that the sum of the roots of a poly- 
nomial a n x n + a n _ l x n ~ l + ■ • ■ + a 0 is just -a n _^/a n . 
(See Section 1.6.) Thus the sum of the roots of / must be 0. 
The second is that if M/N is a rational root in lowest 
terms, then N must divide a n and M must divide a Q . (See 
Problem 2, Section 1.5.) From this latter result we see that 
/ has no rational roots, for the candidates are ±1, ±1/3, 
±5, ±5/3, and none of these gives 0 when substituted 
into /. 

Let N l be minimal such that there is a root of / in a 
field ¥ n> obtainable from Q by a sequence of quadratic 
extensions Q = F 0 c F, c F 2 c ■ • • c F^. Call such a 
root r, ; it has the form a + b^k , where a, b, k G F w ,, 
Vk ^Fjy j, and b ± 0. If we compute f(a + b{k ) we see 
that it has the form A + B\/k , for some A, B G F w 
Since A + By[k = 0 and ^F^,, it follows that B = 0. 
Because / contains only odd powers of x, we find that 
/(a - bjk ) = A - B^k = A = A + B^k = 0 also, so that 
a — b\jk is a second root of /. Let us call it r 2 . 

Suppose a third root r 3 of/ also belongs to F^. By the 
same argument as above so too would a fourth root r 4 . 
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But then r 5 = - r, — r 2 — r 3 - r 4 would be in F W| _ ,, since 
the fk ’s cancel on addition. This would contradict the 
minimality of N v 

Thus there must be a minimal N 2 > N l such that we 
can continue our sequence of field extensions F 0 c F, 
C ' ' ' CF* CF W +1 C • ■ • CF^ until another root 
r 3 G F N . As above F Ni must contain also another root r 4 . 
But then the fifth root, r 5 = - r, - r 2 — r 3 - r 4 would be 
in F N contradicting the minimality of N 2 . 

Therefore none of the roots of / can be constructible, 
and this completes the proof. 

4. Is the difficulty in Problem 2 real or imagined ? 
Show that there are trisectible angles 3 9 such that 9 is not 
constructible. 

We know cos 30 = 4 cos 3 0 - 3 cos 0. If we pick a 6 0 
satisfying cos 8 0 = )/2, then cos 30 o = 1 - (3/2) . 

By Theorem 2, 9 0 is not constructible; but it can obviously 
be constructed given segments equal to both 1 and 
cos 30 o . 


Section 1.4 

1. The number fl +\ f 3 , being constructible, has 
associated with it a sequence of quadratic field extensions 
Q = F 0 c F, c • • • CF„ such that fl +\/3 G F„. De- 
termine the smallest such number N. Using this value, what 
is the degree of the polynomial whose existence is proved in 
Theorem 4? Following the procedure in the proof of Theorem 
4, determine this and any intermediate polynomials. 

By the solution to Problem 10, Section 1.1, N > 2. That 
N = 2 will do is clear from taking F, = Q(V2 ), F 2 = 
F,(V3 ). The proof of Theorem 4 gives a polynomial of 
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degree 2 2 = 4. It is determined as follows: 

x-yfi-fi = 0 
x -V2 =^3 
x 2 - (2& )x + 2 = 3 

which is the polynomial of degree 2 with coefficients in 
F,. Repeating the process, 

x 2 - 1 = (2V2 )x 

x 4 - 2x 2 + 1 = 8x 2 
x 4 - 10x 2 +1=0. 


2. Does there exist a polynomial equation with rational 
coefficients and of degree less than that determined in 

Problem 1 such that V? is a root ? 

No. Suppose there were a lower degree polynomial 
equation with rational coefficients, such that fl is a 
root. Then for some rationals a, b, c, and d (not all 0) we 
would have: 

0 = a{{ 2 +f3)+ b(f2 +f3 )+ c(fl +V3 ) + d 
= a[2f2 + 6V3 + 9\^ + 3>/3 ] + 6[2 + 2V6 + 3] 

+ c[V2 +V3" ] + d 

= f2 [11a + c] +V3 [9a + c] +V6 [2b] + [5b + d]. 

But by the solution to Problem 7, Section 1.1, b = 0. 
Similar calculations imply that a = c = d = 0. Thus there 
is no such equation. 
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Section 1.5 

1. a. If f is a polynomial of degree n > 1 over a field 
F, and if r is any element of F, show that there is a 
polynomial q of degree n - 1 over F and a number R in F 
such that f(x) = (x — r)q(x) + R. 

b. Show that, in the above, R = f(r). 

c. Let f be a polynomial over C. Show that r is a 
root of f if and only if x - r is a factor of f. 

a. If f(x) = a n x" 4- a n _ } x n ~ l + • • • + a 0 , we seek to 
find the coefficients of a polynomial q(x) = A n _ l x"~ l + 
• ■ • + A 0 such that /(x) = (x - r)q(x) + R, for some 
number REF. Thus, by equating coefficients of like 
powers of x, we obtain the equations: a n = A„_ u a„_ x = 
— r A„~ i + A n _ 2 , . . . , a n -j — ~ r A„~j + A n _j_\, . . . , a 0 
= - rA 0 + R. These may be solved in order, yielding 
appropriate Afs and R. 

b. In /(x) == (x - r)q(x) + R, let x take the value r. 
This gives R = f(r). 

c. The “if” part is obvious and the “only if” part is a 
consequence of the above. 

2. If f is a polynomial with integral coefficients it is 
sometimes useful to know whether it has any rational roots. 
Suppose M / N is a rational root expressed in lowest terms. 
There is a simple necessary condition connecting M and N 
with the first and last coefficients of the polynomial. Find it. 
( This result is called the Rational Roots Theorem. It gives a 
necessary condition that can be used to narrow down the 
candidates for a rational root to a small collection which can 
each be tested.) 

Writing /(x) = a n x" + a n _ x x n ~ x + • • • + a 0 , if M/N 
is a rational root in lowest terms, then N must divide a„ 
and M must divide a 0 . To see this, substitute M/N into 
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the equation /(a) = 0, multiply through by N n , and collect 
terms appropriately, thereby obtaining 

a„M n = N[ — a„_ 2 M n ~ 2 N - ■ ■ ■ - a 0 N n ~'], 

and 

a 0 N n = M[-a n M n ~ x - a n _ x M"- 2 N - • • • - a,#" -1 ]. 

From the first, we see that N divides a n M n , but since N 
has no factors at all in common with M, it must divide a„. 
(Just take the prime power decomposition of N and 
observe that the factors, one by one, each divide a n .) 
From the second, by similar reasoning, we see that M 
must divide a 0 . 

3. Use the results of Problem 2 to find any rational roots 
of each of the following: 

a. 8x 3 — 6jc — 1 

b. x 3 — 3x - 1 

c. 2x 3 + 3x 2 - x - 1 

d. 3x 5 - 5 jc 3 + 5x - 1 

e. a 3 + 4a 2 + a — 6. 

a. Candidates: ±1/8, ±1/4, ±1/2, ±1. None of 
these turns out to be a root. 

b. Candidates: ± 1. Neither is a root. Notice how, as 
in Section 1.3, this and the previous equation are closely 
related, the roots of the second being twice the roots of 
the first. But the second is much easier to check for 
rational roots. 

c. Candidates: ±1/2, ±1. - 1/2 is the only rational 
root. 

d. Candidates: ±1/3, ± 1. None is a root. 
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e. Candidates: ±6, ±3, ±2, ±1. The roots are 
-3, -2, + 1. 

4. If F is a countable field, show that the field of all 
numbers algebraic over F is also countable. 

There are many ways to do this; the reader may find 
the following method instructive. Let us first take the case 
where F = Q. Each algebraic number is the root of a 
polynomial with integral coefficients. So first we show 
that the set of such polynomials is countable. Denoting by 
P\>Pi>Ps> ■ ■ ■ the sequence of primes (of which there are 
an infinite number), we associate with each nonzero 
polynomial a n x" + a n _,x n ~‘ + • ■ ■ + a 0 the rational 
number P\°pj'pj' • • ■ p„+\, which is in lowest terms. This 
is a one-to-one correspondence with the positive rationals 
excluding 1 . Since such rationals form a countable set, the 
polynomials with integral coefficients can be written in a 
list: /i,/ 2 ,/ 3 , ... • To make a list of the algebraic 
numbers, we list first the roots of /,, then of / 2 , etc., 
ignoring repetitions. If F is any countable field, the 
polynomials over F can be put in an obvious one-to-one 
correspondence with those with integral coefficients. Thus 
we can also write a list /,, / 2 , . . . , of such polynomials, 
from which we see that the set of all their roots is 
countable. 

5. Show that the result of Problem 2 of Section 1.1 is 
just a special case of Problem 2 of this section. 

jyj 

yN must be a root of x M — N = 0. By the result of 
Problem 2 of this section, any rational root of this 
equation, when expressed in lowest terms, must have a 
denominator dividing 1, the leading coefficient. Hence the 
root must actually be an integer. 
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6. If r is a root of multiplicity >2 of a polynomial f, 
show that f'(r) = 0, where f denotes the derivative of f. 

By the definition of multiplicity, we may write 
/(a) = (jc — r) 2 g(x). By the product rule for differentia- 
tion, /'(a ) = (jc — r) 2 g'(x) + 2(x - r)g(x), from which it is 
clear that f(r) = 0. 

Section 1.6 

1. The cubic 2x 3 — 3a 2 — 32 a — 15 has two roots whose 
sum is 2. Find all the roots. 

The sum of all three roots is - a n _ i /a n = 3/2, so one 
root is - 1/2. Thus (a + j) is a factor, and by division we 
obtain 2a: 3 - 3a 2 - 32a - 15 = (a + ^ )(2 jc 2 - 4a — 30). 
Thus the other roots are the roots of the quadratic factor, 
which are found to be 5 and -3. 

2. Find the product of all the roots of 2a 5 - 3a 4 + 
7a 3 -7a 2 + 3a - 1. Are any of the roots rational ? 

The product of the roots is (- \) n a 0 / a n = 1/2. By 
Problem 2, Section 1.5, the only candidates for rational 
roots are ± 1 and ±1/2. Calling the polynomial /, we find 
/(!)- !,/(-!)- -23,/(l/2)= -1/2, and /(-1/2) 
= -43/8. So there are no rational roots. 

3. Find the value, expressed in terms of the coefficients, 
of the sum of the squares of the roots of a n x n + 
a n-iX nl + ■ ■ ■ + a 0 . 

Since the symmetric polynomial a 2 + x\ + • • • + a 2 
may be expressed in terms of the elementary symmetric 
functions as of - 2 o 2 , we get for the sum of the squares of 
the_ roots (-a n _Ja n ) 2 - 2a„_ 2 /a n , or K 2 _ , - 2a n _ 2 a„] 
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4. Find the value, expressed in terms of the coefficients, 
of the sum of the reciprocals of the roots of a n x n + 
a„_ x x n ~ x + • • • + a 0 , where a 0 ± 0. 

The reciprocals of the roots of the given polynomial are 
the roots of the polynomial a 0 x n + a,*" -1 + • • • + a n , 
the sum of whose roots is - a,/a 0 . 

5. Let P x and P 2 be two polynomials in n variables. 
Show that the highest term of the product P X P 2 is the 
product of the highest terms of P x and P 2 . ( The word 
“ highest ” refers to the ordering on the exponent n- tuples.) 

This is an immediate consequence of the fact that 
inequalities involving exponent n-tuples can be “added” 
in the usual way. That is, suppose (/„ i 2 , . . . , i„) 
> O'!, i' 2 , ..., O and (J x ,j 2 , . . . ,j„) > (j\,j 2 , . . . 
then, from the definition of the ordering and the 
corresponding property for inequalities involving real 
numbers, (/, +j „ i 2 +j 2 , +y„) > (/', +/,, i\ + 

j'v •••»'« +j'n)- 

6. ( Alternate proof of Theorem 8.) Suppose a and b are 
algebraic over a field F. Then a is a root of some polynomial 
over F, whose complete set of roots may be written 
a = a x , a 2 , . . . , a n , where each root is listed as many times 
as its multiplicity. Similarly for b, giving rise to roots 
b = b x , b 2 , . . . , b m . Apply the theory of symmetric functions 
twice successively to the expression 

n m 

n n (•* - a, - bj) 

'=1 7=1 


to conclude that a + b is algebraic over F. In a similar way, 
show that ab is also algebraic over F. 
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Applying Corollary 9a to each coefficient in the poly- 
nomial 117 - i( M — bj)> we see that it is a polynomial over 
F, call it/. Then n?_ ,117- i(* ~ a , ~ b j) = II"- 1 f( x ~ <*/)• 
Applying Corollary 9a to each coefficient in the 
expansion of this as a polynomial in x, we see that it is a 
polynomial over F. But a + b is one of its roots. The 
product case is similar. Write 


n m 

UIHx 


/-i j - i 



■ = Ilarf(x/a,), 


where / is a polynomial over F. Since the degree of / is m, 
each coefficient of affix /a ,) is an element of F multiplied 
by some nonnegative integral power of a,. Taking the 
product over all i, there results a polynomial in x to each 
of whose coefficients Corollary 9a is applicable. 


7. Prove that every permutation of the variables 
x x , x 2 , . . . , x n can be accomplished by a succession of 
transpositions, where a transposition is the single interchange 
of two of the variables. ( This was used in the proof of 
Corollary 9b.) 

By induction on the number of variables. If n = 1, there 
is nothing to prove. Assuming it is true for n - 1 variables, 
proceed as follows. We want to transform the ordered 
m - tuple (x„ x 2 , ■ ■ • , x„) into (x t(1) , x H2y • • ■ , **<„)), 
where <f> is a permutation on the first n integers, by a 
sequence of transpositions. For j = <t>(n), interchange x } 
and x n . It remains to properly permute the remaining 
n - 1 variables, which can be done by a succession of 
transpositions, according to the inductive hypothesis. 


8. With reference to the preceding problem, there may 
be many different sequences of transpositions resulting in the 
same permutation. Show that for a given permutation, all 
sequences of transpositions leading to it have the same 
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parity, that is, either they all contain an even number of 
transpositions or else they all contain an odd number. (In 
this way, permutations can be classified as “ even ” or 
“odd".) 

Consider the behavior of the alternating function P of n 
variables, P(x,, x 2 , . . . , x„) = IXx/C*/ — x y), under two 
sequences S and S' of transpositions leading to the same 
permutation. Writing P out, 

**-(*!- *2)(*1 - * 3 )(*l “ -*4) ' ' ' (*l - X n) 

■ ( x 2 ~ x i)( x 2 ~ X 4 ) ■ ■ ■ (x 2 - X„) 


■ (Xn-\ ~ x n ). 

we see that any transposition merely changes the sign of 
P. For, suppose x t and Xj are interchanged, with /' < j. 
Then the factors involving neither x, nor Xj are left 
unaltered; the factor (x, - xf) is replaced by (x ; - x,) = - 
(x, — Xj)\ and the remaining factors may be paired off to 
form products of the form ±(x, - x k )(xj - x k ), k i or 
j, each of which is unaltered by the transposition. Now 
suppose that S contains N transpositions and S' contains 
N' transpositions. Under the resulting permutation, P is 
transformed to (- iyV and (- 1 ) N P. Since these must be 
the same, N and N' must both be odd or both be even. 

Section 1.7 

1. Prove that e z, e z 2 = e z,+Z2 . 

Letting z, = x, + iy x and z 2 = x 2 + iy 2 , we want to 
show that 

^'(cosj, + i sin y } )e Xl (cos y 2 + i sin/ 2 ) 


= e x ' +x *(cos (y l +y 2 ) + i sin(j, + y 2 )). 
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Since x, and x 2 are real, e x 'e* 2 = e x,+X2 , so it suffices to 
show that (cos/, + i siny,)(cosy 2 + i sin y 2 ) = cos(j, + 
y 2 ) + i sin(y, + y 2 ). Multiplying out on the left and using 
the fact that i 2 = — 1, this simply reduces to the usual sine 
and cosine addition formulas. 


2. Prove that the binomial coefficients are in fact 
integers. 

For N > 1 and 0 < K < N, we want to show that the 
binomial coefficient (*), given by (*) = N\/K\(N - AT)!, 
is an integer. By a simple calculation, for 1 < K < N — 1 , 


For K - 0 or N, (£) = 1. The result now follows at once 
by induction on N. 


3. The modulus of a complex nu mber z = x 4- iy, where 
x and y are real, is given by |z| =yx 2 + y 2 . If F(z) is a 
complex function of a complex variable, then the definition 
of the statement lim z _ >Z( F(z) = L is that for every e > 0 
there exists a S> 0 such that \F{z) — L| < e whenever 
0 < |z — z 0 | < S. This is formally identical with the real 
case, the difference being the use of moduli instead of 
absolute values. If G(z) is a complex function of a complex 
variable, the derivative of G is the new function whose value 
at any point z 0 is given by 


G\z o) = Bm 

Z—*Zq 


G(z) - G(z 0 ) 


(/(j in the real case, some complex functions are 
differentiable and some are not.) Show that the usual rules 
from elementary calculus carry over in the complex case for 
each of the following: 
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a. the derivatives of sums, differences, products, and 
quotients', 

b. the derivatives of polynomials', 

c. the derivatives of e z and e~ z . 

a. The calculations are identical with those from 
elementary calculus, so we simply work out the details for 
one part, say the product rule. Given two differentiable 
functions u and v, we want to show that (uv)'(z 0 ) = 
u(z 0 )v'(z 0 ) + v(z 0 )u'(z o). We have: 


(«°)'(*o) = fog 

Z—*Zq 


u(z)v(z) - u(z 0 )v(z 0 ) 
z - Z 0 


- lim 

z—*z 0 


u(z)v(z) - u(z)v(z 0 ) + u(z)v(z o) - u(z 0 )v(z 0 ) 
z - z 0 



We have of course used the basic properties of limits as 
well as the continuity of u at z 0 , all of which are very 
straightforward. 

b. The results of the previous part, together with the 
observation that the derivative of a constant is 0, reduce 
the problem to showing that for n > 1 , the derivative of 
z" is nz"~\ For « = 1 it is obvious. The induction 
step follows from the product rule: (z")' = (z-z n_l )' = 
z- (n - l)z n_2 + 1 •z" -1 = nz" _1 . 
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c. From the definition of the derivative and from 
Problem 1, 

(< e ! )'(z 0 ) = lim g ~ e ° = e z ° lim e ~ 1 , 

^~* z o Z - z 0 h—*Q h 

where h = z — z 0 is complex. It remains to show that this 
limit equals 1. Writing h = a + ib, with a, b £ R, this limit 
becomes 

e a (cos b + i sin b) — 1 (a - ib) 

lim - — 

(a, *)— *(o, o) a + ib (a - ib) 


lim 

(a, b)M 0 , 0 ) 


ae a cos b — a + />e a sin b 
a 2 + b 2 


+ i lim 

(a, *)->( 0 , 0 ) 


b - be a cos b + ae a sin b 
a 2 + b 2 


We want to show that the first limit equals 1 and the 
second equals 0. These results follow after appropriate 
algebraic manipulation, coupled with the observation that 
|2a6| < a 2 + b 2 and the following elementary real limits: 


,• 1 - cos b „ sin b , ... e a — 1 , 

lim = 0, lim — = 1, and lim = 1. 


The derivative of e z can now be obtained by the 
quotient rule, since e~ z = \ / e z . 

4. Let <p(u) be a complex-valued function of the real 
variable u on the interval [a, b] and suppose its real and 
imaginary parts are given by differentiable functions r(u) 
and s(u), respectively, so that 

<p(u) = r(u) + is(u). 
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Then the derivative 4>'(u) is defined by the equation 
= r’(u) + is'(u). 

Similarly, the integral f b <j>'(u)du is defined by the equation 
f <f>'(u)du= f r'(u)du+ i f s’(u)du. 

J a a J a 

Prove the relevant form of the Fundamental Theorem of 
Calculus: 


<K b ) ~ <K a ) = f <t>'(u)du. 

J a 


From elementary calculus, f b r'(u)du = r(b) - r(a), and 
similarly for .v. Thus f b <t>'(u)du = r(b) - r(a) + is(b) - 
is(a) • <Hb) - 4fa). 

5. Let x be a fixed complex number, g a differentiable 
complex valued function of a complex variable, and u a real 
variable. Then the function = g(ux ) is a complex 
function of a real variable. Show that </>'(«) = g (ux) ■ x. 

The definition of </>'(«) from Problem 4 clearly implies 
that for each u 0 , 


<H U )~<K U o) 

(u 0 ) = lim 

u^w 0 u — U n 


from which we deduce 


<t>'(u 0 )= lim 

v 7 U-+U 0 


g(ux) - g( Uo x) 

X 

ux — u 0 x 


= g'( u o*)-x- 
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6. In which part(s) of the proof of Theorem 10 was it 
necessary to use the fact that p is prime ? 

Only in the proof of Claim 3. 

7. It was implicitly assumed in the proof of Theorem 10 
that there are an infinite number of primes. Where was this 
assumed ? Prove this fact. 

It was assumed in the final step, yielding the 
contradiction. That is, it was assumed that there actually 
exists a prime larger than k, a, and a 0 and such that 
CB P ~ { /(p - 1)! can be made arbitrarily small. 

Suppose that there were only a finite collection of 
primes, p x , p 2 , . . . , p N . Then the number l+JJfL,/?, 
would have to be prime, not being divisible by any of 
p v p 2 , . . . ,p N . But it is too large to equal any of the 
primes in the list, a contradiction. 

8. Prove that the number e is transcendental. ( Hint : 
This is quite a bit easier than the corresponding proof for it, 
but can be developed along the same outline. In particular, 
the assumption that e is algebraic leads immediately to an 
equation very similar to the equation e q ' + e qi + ■ • • + 
e q ” + k - 0, but where the exponents are integers and there 
may be integral coefficients in front of the terms. Working 
backwards, the analogue of g can be constructed, and from 
this f and F. The proof proceeds almost as before, but 
without the complication of symmetric functions, except in 
summing, some coefficients need to be inserted. This 
problem should help the reader gain a better understanding 
of the proof for tt. 

If e were algebraic, it would satisfy an equation of the 
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form 


b m em + b m- \ e>n 1 + • • • + V 1 + k = 0, 

where k and the bjs are integers, k > 0. Define g(x) = 
U?- i( x _ j)- Defining / and F as in the proof for m, so 
that a = 1, we have 

F(x)-e x F{ 0) = - f xe (l ~ U>x f(ux)du. 

J o 

For each j, substitute in x =j and multiply by by, 
summing the results we obtain 



The term on the right is treated exactly as in the proof for 
i t, in order to show that it can be made arbitrarily small 
for p a sufficiently large prime. The term kF( 0) is also 
treated as earlier, it not being divisible by p for any prime 
greater than or equal to both k and ml. Thus, it suffices to 
show that i b j F U) * s divisible by p, and here the proof 
is much simpler than in the case of m. In particular, 

m m s + p m s+p 

2 b jF(j) = 2 bjl f r \j) = 2^2 f r) U), 

j - 1 J- 1 r-0 y=l r ~P 

since lower order derivatives of / have a factor g, which is 
0 at each j. But for r > p, each coefficient of ( p - I)!- 
/ (r) (x) is divisible by p\, and so each coefficient of f (r \x) 
is divisible by p. Plugging in integral values of x, namely 
x = j, 1 < j < m, yields for the sum an integer divisible 
by p. As noted earlier, this completes the argument. 
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Section 2.1 

1. Prove the division algorithm for Z: If m, n G Z, n 
# 0, then there exist unique integers q and r, with 
0 < r < |n|, such that m = nq + r. 

If n > 0, the value q = [m/n ], where the brackets 
denote the greatest integer < m/n, certainly works, for 
then r = m - nq is within the required interval. To show 
this, we first observe that m - nq = m - n[m/ n] > m — 
n(m / n) — 0. But also, m — nq = m — n[m/n\ < m — 
n((m/ n) — 1) = n, since [m/n] > (m/n) - 1. To show 
uniqueness of q and r, suppose that m = nq x + r, and that 
m = nq 2 + r 2 , with both r, and r 2 within the required 
bounds. Then n(q x - q 2 ) = r 2 — r,, so that if q , ^ q 2 , r 2 
and r, are at least n units apart, which is impossible since 
they are both within the interval from 0 to n - 1 . Hence 
q x = q 2 , from which it follows that r, = r 2 . 

Now, for n < 0, by the above there are unique integers 
q and r such that m = (- n)q + r, with 0 < r < |n|. Simply 
take q = —q. 

2. Prove the Euclidean algorithm for L: If m, n G Z, 
n ± 0, then there exist integers s and t such that the g.c.d. 
(m, n) = sm + in. Moreover, the g.c.d. can be obtained in a 
finite number of steps by the method of repeated division of 
the most recent divisor by the most recent remainder, 
beginning with the division of m by n according to the 
division algorithm, as outlined in the text just prior to 
Theorem 13. 

By the division algorithm, the repeated division is 
certainly always possible. Since each remainder is at least 
one unit closer to 0 than the previous one, the process 
terminates (i.e., gives a 0 remainder) after at most n 
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divisions. Thus we have: 

m = nq^ + r, 

?i = r \Ri + r 2 
r \ = r 2 q 3 + r 3 


r N- 2 ~ r N- 1 Rn + r N 
r N— 1 = r sRN+ I O' 

Exactly as in the text, r N can be written in the form 
sm + tn, from which we see that (m, ri) divides r N . But, 
beginning with the last equation and working backwards, 
we see that r N divides r N _ x , r N _ 2 ,...,q v n, and m. 
Hence r N divides (m, n). Thus we conclude that 
r N = (w, n). 

3. Find the g.c.d. of 264 and 714. Can you take them 
in either order ? 

Yes, you can take them in either order, as the Euclidean 
algorithm makes no assumption about their relative size. 
Let us verify this here by doing the computations 
beginning both ways: 264 = 714 • 0 + 264, 714 = 264 • 2 + 
186, 264= 186-1 + 78, 186 = 78-2 + 30, 78 = 30-2 + 
18, 30 = 18 • 1 + 12, 18=12-1 + 6, 12 = 6-2 + 0. If we 
begin with 714 = 264 • 2 + 186, we see that this is just the 
second step in the above sequence. We find that the g.c.d. 
is 6. 


4. Use the Euclidean algorithm for Q[x] to find a 
g.c.d. of f(x) = 2jc 6 - 10x 5 + 2x 4 - lx 3 - 15x 2 + 3x - 15 
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By ordinary long division of polynomials, we find that 
f — g’ (2x - 2) + (5x 3 — 25x 2 + 5x - 25), g = (5.x 3 — 
25x 2 + 5x — 25) • (x 2 /5 + x/5 + 1 /5), so that (/, g) = 
5x 3 - 25x 2 + 5x - 25. 

5. Show that the g.c.d. of two polynomials is 
independent of the field over which we consider them to be. 
That is, suppose F, and F 2 are fields and f and g are 
polynomials belonging to both F,[x] and F 2 [x]; show that 
they have the same g.c.d. in both F,[x] and F 2 [x], (Note: 
Some divisibility properties, such as irreducibility, are 
closely related to the field in question, but the g.c.d. is seen 
here not to be.) 

Let F 3 = F, n F 2 . The application of the Euclidean 
algorithm in either F,[x] or F 2 [x] is equivalent to its 
application in F 3 [x], so the same result is gotten in both 
cases. 

6. Let f, g E F[x], / irreducible. Suppose there exists 
some a EC such that f(a) — g(a) = 0; that is, they have a 
common root in C. Show that f divides g (in F[x]). 

Their g.c.d. is nonconstant, since in C[x] it is divisible 
by (x - a). Since / is irreducible, their g.c.d. must be / 
itself. Hence / 1 g. 

1. Let f, g,h E F[x], / # 0 (i.e., for emphasis, f is 
not the 0 polynomial, although it may be 0 for certain values 
of x). If fg = fh, show that g = h. 

By the division algorithm, the quotient in dividing fg or 
fh by / is unique. Hence g = h. 

8. Show that (x n - l)/(x - 1) is a polynomial over 
Q for all (positive integers) n, and determine precisely the 
set of values of n for which it is irreducible. 
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Since 1 is a root of (x n - 1), (x — 1) divides (x" - 1) in 
C[x] by Problem 1, Section 1.4. But since (x - 1) and 
(x" - 1) are both in Q[x], the quotient will also be 
in Q[xj. In fact, we easily see that (x" - l)/(x — 1) = 
x" -1 + x"~ 2 + • • • + x + 1. If n is prime, this is 
irreducible, for the substitution x = u + 1 transforms it to 
the form 


[(u+ 1)"- l]/u = 



+ 



2 


+ • 

■■ +1 


= u n ~ 

,+ (? 

) Mn 2 + (2)“" 3 

+ ■ 

. i 


• • +1 

In - l)’ 



/« 


in which each coefficient, except the leading one, is 
divisible by n. To see the latter, we write out 

( n ) = 'll 

\k) *!(«-*)! ’ 


which is integral; since n is prime, [ k\(n - &)!] | ( n - 1)!, 
so that n divides (”). The last term is just n itself, so it is 
clearly not divisible by n 2 . By the Eisenstein criterion, the 
polynomial in u, and hence that in x, is irreducible. If n is 
not prime, in which case we can write n = jk, we have 

x” - 1 = x n - 1 _ x* - 1 
x - 1 x*- 1 x- 1 


= [(x*y 1 + (x*y 2 + • • • + 1] 

x[x*-* + x*- 2 + • • • + 1 ]. 
which shows that for such n, the polynomial is reducible. 
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9. Decompose x 10 — 1 into a product of irreducible 
factors in Q[x]. 

First we use the fact that it is a difference of squares, 
and then we apply the result of the previous problem, thus 
obtaining 

x 10 - 1 = (x 5 + l)(x 4 + x 3 + x 2 + x + l)(x - 1). 

But since - 1 is a root of x 5 + 1, x + 1 must divide x 5 + 1, 
so we further obtain 

x 10 - 1 = (x 4 - x 3 + x 2 - x + l)(x + 1) 

X(x 4 +x 3 + x 2 + x+l)(x- 1). 

The last three factors are already known to be irreducible. 
The irreducibility of the first factor may be established by 
the same method as in the previous problem, or from that 
result by the change of variables v = - x. 

10. Let f be a polynomial irreducible over F. Show that 
f has no multiple roots. ( Hint : Consider the g.c.d. of f and 
its derivative.) 

If a were a multiple root of /, then by the definition of 
multiplicity we could write / = (x - a) 2 g, for some g e 
C[x]. Hence f\a) = 0, since /' = (x - a) 2 g' + 2(x - a)g. 
Hence, (/, /') has (x - a) as a factor in C[x], and thus its 
degree is >1. But the g.c.d. divides / in F[x]. This is 
impossible, since its degree is also < deg /' < deg /, and / 
is irreducible. 

11. A polynomial with integral coefficients is said to be 
primitive if its coefficients are relatively prime ( that is, 
there is no integer other than ± 1 which divides all the 
coefficients). Iff is any polynomial with integral coefficients 
and if g is a primitive polynomial which divides f, show that 
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the quotient f/g actually has integral coefficients. (As an 
application of this, observe that if a monic polynomial with 
integral coefficients divides another polynomial with integral 
coefficients, then the quotient has integral coefficients.) 

This follows from the proof of Lemma 12. In particular, 
using the notation of that proof, the fact that g has 
integral coefficients implies that 5 = 1, and the fact that 
the coefficients are relatively prime implies that A = 1. 
Thus, since E = ± 1, it follows that / = gh = ±g[(D/C) 
h], and so the polynomial (D/C)h, which has integral 
coefficients, must be ± h itself. 

The application of this follows from the observation 
that a monic polynomial with integral coefficients is 
primitive. 

12. If f is a polynomial with integral coefficients and if 
g is a monic polynomial with integral coefficients, show that 
the quotient q and the remainder r, resulting from the 
division of f by g according to the division algorithm, both 
have integral coefficients. 

This additional property may be carried through in the 
induction argument used to prove the division algorithm. 
In particular, in that proof, b m = 1, so that the reduced 
polynomial / - (a„/ b m )x"~ m g also has integral co- 
efficients. 

*13. Develop an algorithm for factoring any polynomial 
in Q[jc] into irreducible factors. 

Given a polynomial / G Q[jc], we give a procedure for 
finding a nontrivial factor, if it has one. By division, we 
can then write / as a product, and this essentially com- 
pletes the problem, for the procedure can then be reap- 
plied to each of the resulting factors. To begin, observe 
that multiplying through by an appropriate integer, we 
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may transform / into a polynomial with integral coef- 
ficients; and since the factorization of this new poly- 
nomial is essentially equivalent to that of the original one, 
we assume without loss of generality that / already has 
integral coefficients. Let n denote the degree of /. We seek 
a polynomial g(x) = '2/£ =0 a k x k such that g \ f, a m + 0, 
and 1 < m < n — 1. By Gauss’ Lemma, it suffices to 
consider the case where g and f/g have integral 
coefficients. 

By trial and error, pick n + 1 integers * 0 , x x , . . . , x„ 
such that none of them is a root of /. (This involves trying 
at most 2 n + 1 integers, since / has at most n distinct 
roots.) Since g and / have integral coefficients, for every j, 
g(xj ) and f(Xj) are themselves integers. Since f/g has 
integral coefficients, g{xf) | f{xf) in Z. Since each f(xf) is a 
known integer, it has only a finite number of factors, 
which we are able to list. To put this last fact slightly 
differently, there are only a finite (and known) set of 
n-tuples (/' 0 , . . . , <„) such that for every j, ij \ f(x). 

Now, for each such n-tuple there is a unique polynomial g 
of degree < n such that for every j, g(xf) = ij. To find it, 
write g(x) temporarily allowing m — n, and 

solve the simultaneous equations g(xj) = ip 0 < j < n, for 
the coefficients, the a k ’s. The determinant of the system is 

1 x 0 xl ... xS 

1 *i x] ... x” 

1 X. xl ... X " 

n n n 

which is a Vandermonde determinant, and nonsingular 
since the xfs are distinct. Thus there is a unique solution 


= II(x, - Xj), 

•>j 
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for the a k ' s. If a n # 0, or if only a 0 =/= 0, the resulting g is 
not a relevant candidate, since its degree would be n or 0; 
and in this case it would be discarded. In each other case, 
the resulting g may be divided into / according to the 
division algorithm to see whether the remainder is 0. Once 
a divisor of / is obtained, the procedure is repeated on the 
resulting factors of /. If / has a divisor, one must be 
obtained by this procedure. Thus, if no divisor is ob- 
tained, one can conclude that / is irreducible. 

(This algorithm is primarily of theoretical interest; it 
would be very time-consuming to carry it out in most 
cases.) 

Section 2.2 

1. If E is an extension of F, verify that E may be 
considered as a vector space over F. (A set V is said to be a 
vector space over a field F if it has the following properties', 
(a) there is a rule for adding vectors, always resulting in an 
element of V; ( b ) this addition operation is associative and 
commutative', (c) there is a vector called 0, such that 
v + 0 = v for every v; (d) for every v there is a vector 
called -v, such that u + ( — u) = 0; (e) there is a rule, 
called scalar multiplication, for multiplying a scalar times a 
vector, always resulting in an element of V; (/) for a, b £ F 
and u, v E V, ( ab)v = a(bv), a(u + v)= au + av, and 
(a + b)v = av + bv, (g) for every v E V, 1 v = v, where 1 
is simply the number 1 in F. 

Every one of the properties follows immediately from 
the definition of a field and the properties of complex 
addition and multiplication. 

2. Show that if E is a quadratic extension of F, then 
[E : F] = 2. 
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By the definition of quadratic extension, E = F(VF ), 
where k E F but jk ^F. But \/k is a root of an irreducible 
polynomial of degree 2 over F, namely x 2 - k. It is 
irreducible because if it factored over F, the factors would 
be linear, and this is impossible since neither of its roots, 
±fk , are in F. By Theorem 17, [E : F] = 2. 

3. Let E be an extension of F. Show that [E : F] = 1 
if and only if E = F. 

If E = F, then the single number 1 is a basis for E over 
F, and so [E : F] = 1. If E # F, let a be some element in 
E but not in F. Then the numbers a and 1 are linearly 
independent over F, so that [E : F] > 2. (Of course, [E : F] 
may be infinite.) 

4. If E is an extension of F such that every element of 
E is algebraic over F, we say that E is algebraic over F. Is 
every algebraic extension a finite extension ? 

No; for example, let E be the set of all algebraic 
numbers and let F = Q. Then for any n, the n algebraic 
numbers 1 , 2 1/n , 2 2/n , . . . , 2 (n ~ l)/ " are linearly indepen- 
dent, since deg2 1 /" = /j by the irreducibility of x n - 2 
over Q. Thus E contains linearly independent sets with 
arbitrarily many elements. 

5. If E is an algebraic extension of K, and K is an 
algebraic extension of F, show that E is an algebraic 
extension of F. ( See the previous problem for the definition 
of an algebraic extension.) 

Pick an arbitrary a G E. We want to show that a is 
algebraic over F. Now, a is a root of some polynomial / 
over K, since E is algebraic over K. Let the coefficients of 
this polynomial be a n , . , o 0 . Thus, a is actually 
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algebraic over K = F(a„, a n _ h ... , a 0 ), which is a 
multiple algebraic extension of F and hence a finite 
extension of F. By Theorem 19, [K(a): F] = [K(a) : K] • 
[K : F], which is finite. Since a is in some finite extension 
of F, by the first part of the proof of Theorem 18, a is 
algebraic over F. 

6. Prove Theorem 8 by using the ideas of this section. 
( Hint : Observe that if a and b are algebraic over F, then 
F(a, b) is a finite extension of F.) 

The observation in the hint follows immediately from 
Theorem 18. Let c denote any of the numbers a + b, 
a — b, ab, or a / b (as long as b ^ 0). Since c £ F(a, b), we 
know that F c F(c) c F(a, b) and so deg F c = [F(c) : F] is 
finite by Theorem 19. By Theorem 17, c must then be 
algebraic over F. Alternatively, apply to c the first 
observation in the proof of Theorem 18. 

7. Express Q(f2 ,f3 ) as a simple extension of Q. 
Find the degree of this extension over Q. 

Q(V2 , V3 ) = Q(V2 +'J3 ). It obviously suffices to show 
that V2_and V3 each belong to Q(^2 +fi). Define a = 
V2 +V3 . Then 


a 2 = 5 + 2f6 

| ) a =v riy =2V3 +3 V2 
)a-2a=V2 


3a-(^-^)n=V3 
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It was_shown in Problems 1 and 2 of Section 1.4 that 
deg Q (V2 +V3 ) = 4, so this is the degree of the extension. 

8. Let f be a polynomial of degree n over F. Show 
that there exists an extension E of ¥ in which f is completely 
reducible (i.e., can be factored into linear factors) and such 
that [E : F] < n\. 

Let the roots of / (in C) be a 2 , . . . , a n . Define 
F, = F(a,), F 2 = F,(a 2 ), • • • , F„ = F„_ ,(«„)• [F, : F] < n, 
since deg F a, < n. Now factor x — a, out of / in F,, so 
that a 2 satisfies a polynomial of degree n - 1 over F,. 
Thus deg F a 2 = [F 2 : F,] < n — 1. Continue in this way 
and then apply Theorem 19, to obtain [F„ : F] = 
[F„ : F„_j] • [F„_, : F„_ 2 ]- • - [F, : F] < (1X2)- • • (n - 1)- 
(«)=«! 

9. Let a be transcendental over F. Show that the set 
{a k | k G Z} is linearly independent in F(o) over F. Is it a 
basis for F(a) over F? 

To show linear independence, suppose we have a linear 
combination of elements a k equal to 0. Multiplying 
through by the highest power of a in any denominator, we 
obtain a as a root of a polynomial over F. Since a is 
transcendental over F, it must be the 0 polynomial; thus 
the linear combination we started with must have all 0 
coefficients. 

This set is not a basis, since it does not span F(o). For 
example, 1 /(a + 1) cannot be written as a linear 
combination of the proposed elements. For if it could, 
there would be an n > 0 such that 


1 

a + 1 


2 c k ak > c k E f. 


k = —n 


But then we could multiply both sides by (a + \)a n to get 
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a as the root of a nontrivial polynomial over F, which 
would contradict its transcendence over F. 

10. Let f 6 R[x] with deg/ > 3. Show that f is 
reducible. 

Let a be any root of /. If / were irreducible, then 
deg R a = deg f > 3. However, 2 = [C : R] = [C : R(a)] 
[R(d) : R], so that [R(a) : R] = deg R a =1 or 2. (An 
alternate proof would be to establish first that if a is a 
root of a polynomial over R, then so too is a, its complex 
conjugate. When / is completely factored in C[x], the 
factors corresponding to complex conjugate roots may be 
combined to give quadratic factors over R. Thus the 
irreducible factors of / over R are linear and quadratic 
only. See the next problem.) 

1 1 . Use the result of the previous problem to show that 
forf GR[x], a is a root if and only if its complex conjugate 
a is a root. ( The complex conjugate of b + ci is b - ci, 
where b, c G R.) 

Since a — a, it suffices to show the “only if” part of the 
statement. If a is real, we are done, since then a = a. If a 
is not real, then by Problem 10 it must be the root of a 
real quadratic factor of /, say Ax 2 + Bx + C. By the 
quadratic formula, the roots of this factor are complex 
conjugates (since they are not real), and so they are a and 
a. Since this quadratic divides /, a is also a root of /. 

*12. If E is a finite extension of F, show that there are 
only a finite number of intermediate fields K, E D K D F. 
( This may be surprising.) 

By Theorem 18, we may write E = F(a) for some a 
algebraic over F. Let / be a minimal polynomial for a over 
F. Since a is certainly algebraic over any intermediate 
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field K, for each such field there exists a unique monic 
minimal polynomial g for a. By Problem 6 of Section 2.1, 
g divides /. Furthermore, if K, and K 2 are distinct 
intermediate fields and g t and g 2 the corresponding 
minimal polynomials for a, we will show that g ] # g 2 . For 
if g, = g 2 , define K to be the multiple (algebraic) extension 
of F by the coefficients of and g 2 . Since deg K a = deg K a 
and E = K(a) = K,(a), by Theorem 17, [E : K] = [E : K,']. 
By Theorem 19 then, [K : F] = [K, : F]. Since in addition 
k,dk, we can conclude that K = K,. Similarly, K = K 2 , 
and so K, = K 2 . 

Consequently, distinct intermediate fields have distinct 
g’s. But / obviously has only a finite number of monic 
factors in E[x], and so there can only be a finite number 
of intermediate fields. 

13. Suppose that E = F(r), where r is a root of an 
irreducible polynomial f over F. If r is another root of f and 
ifrE E, show that E = F(r). 

Since r E E, we can write F c F(?) c E. Since deg F r 
= deg F r, Theorem 17 implies that [E : F] = [F(r) : F] 
= [F(r) : F], By Theorem 18, [E : F] = [E : F(r)][F(r) : F], 
and so [E : F(r)] = 1. By Problem 3, E = F(r). 

Section 23 

1. Use the techniques of this section to show that it is 
not possible to divide an arbitrary angle into five equal parts 
by the usual rules of construction. 

The relevant equation was derived in Problem 3 of 
Section 1.3. With x = 2 cos 9, we have x 5 - 5x 3 + 5x — 
2 cos 56 = 0. If we take 59 such that cos 50 = 5/6, which 
angle is constructible since 5/6 E Q, we see that the 
resulting polynomial is irreducible. (Multiply through by 3 
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and apply the Eisenstein criterion with p = 5.) Thus 
2 cos 9, being a root of this, is not constructible. But if 
this particular constructible 59 were able to be divided 
into five equal parts, cos 0 and hence 2 cos 9 would have 
to be constructible. 

*2. Let S = {9 | 6 is a trisectible angle). Prove that S is 
countable. {Keep in mind the distinction drawn at the end of 
Section 1.3.) 

Since the set of algebraic numbers is countable, it 
follows that the set of angles 3 9' for which 2 cos 3 9' is 
transcendental has for its complement a countable set. We 
show that for each such 3 9', 3 8' is not trisectible. If it 
were, by an obvious extension of Theorem 1, there would 
have to be a sequence of fields Q(2 cos 39') = F 0 c F, 
C • • • Ct N such that 2 cos 9’ E¥ n and such that each 
Fy +1 is a quadratic extension of F,. Letting a = 2 cos 39', 
at least one root of x 3 - 3 x - a, namely 2 cos 8', would 
have to have some power of 2 for its degree over Q(a). 
For this to happen, x 3 — 3x - a would have to be 
reducible over Q(a); otherwise each of its roots would 
have degree 3 over Q(a). But if x 3 - 3x - a is reducible 
over Q (a), at least one of its factors must be linear, and so 
it would have a root in Q(a). Since every element in Q(a) 
has the form f{a)/g{a) where / and g are polynomials 
over Q evaluated at a, we would thus have for some / and 
g 

[f( a )/g( a )Y~ 3 [K a )/g( a )] ~ a = °, 

and hence 

U( a )f- 3 [/(«)][ g(«)] 2 - 4 g(a)] 3 =0- 

This would give a as a root of the polynomial 
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/ 3 - 3 fg 2 - xg 3 e Q[x], a contradiction as long as we can 
show that this polynomial is not the 0 polynomial. Let 
n = deg / and m = deg g. (Obviously, neither can be the 0 
polynomial.) The three terms have respective degrees 
3 n, n + 2m, and 3m + 1. If n > m, there is a single term 
of highest degree 3n. If n < m, there is a single term of 
highest degree 3 m + 1. In either case, the terms cannot all 
cancel and so the polynomial is nontrivial. 

Section 2.4 

1. Determine the cube roots of unity by a geometrical 
analysis. 

Dividing the unit circle into three equal parts, beginning 
at (1, 0), gives for the other two points (cos 120°, sin 120°) 
and (cos 240°, sin 240°). Evaluating these trigonomet- 
ric functions, we get the points (-1/2, +f3 / 2) and 
( — 1 /2, — /3 /2), which correspond to the complex 
numbers - 1/2 + iff /2 and — 1 /2 — if . 3 /2. 

2. Evaluate <#>(£:) for k — 1, 2, ... , 15. 

<#>(1) = 1, <#>( 2) = 1, *3) = 2, <#>(4) = 2, <K 5) = 4, <f>( 6) = 2, 
<f>(7) = 6, <K 8) = 4, <#<9) = 6, <K10) = 4, <>(1 1) = 10, <#<12) 
= 4, <#>(13) = 12, <#>(14) = 6, <#<15) = 8. 

3. Show that <#>(/>) = p — 1 for p a prime. 

Since p is prime, every number from 1 through p — 1 is 
relatively prime to p. 

4. Determine <#>( p 2 ) for p a prime and n a positive 
integer. 

Of the p n numbers from 1 to p n , only p, 2 p, . . . ,p n ~ l p 
are not relatively prime to p. Since there are p n ~ ] of these 
numbers, <#>(/?") = p" - p "~ 1 = p n ~'(p — 1). 
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5. Would it make sense to discuss “ primitive ” nth 
roots of 2? 

No. If z m = 2, 1 < m < n, then \z\ m = 2, which for a 
fixed z can only happen for m = n. Thus although every 
root might be termed “primitive”, because of this fact the 
term would lose any usefulness. Moreover, the basic 
properties of primitive roots of unity would not carry 
over. 

6. How many primitive 20 th roots of unity are therel 

Eight, since <J>(20) = 8. 

7. Describe the location of the seventh roots of 1 — i 
in the complex plane. 

An argument of 1 - i is —m/ 4. Take one seventh of 
this to get an argument of one root. Since |1 - /| = \/2 , the 
modulus of any root is fl . Thus the roots are equally 
spaced on the circle x 2 + y 1 = f2 , beginning with the 
point corresponding to 0 = —m/ 28. 

8. Show that if a is an nth root of unity , then a is a 
primitive mth root of unity for some m such that m \ n. 

It is obvious that a is a primitive mth root of unity for 
some m such that 1 < m < n. Just let m be the smallest 
positive integral exponent such that a m = 1. To show that 
m | n, use the division algorithm for Z to write 
n — mq + r, where 0 < r < m. We simply want to show 
that r = 0, which we do by showing that a r = 1. To do 
this, we calculate a r = a n ~ m i = a" / a mq = l/(a m ) ? = 1/1 
= 1 . 

9. Show that 2<#| n^(^) = ”> where the sum is over all 
positive integers d which divide n. Verify this result by 
direct calculation for the case n — 12. 
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Both sides represent the number of «th roots of unity. 
For the right side this is obvious. For the left, this follows 
from the previous problem, noting that the m there is 
unique. For n = 12, </>(l) + <f>(2) + <p(3) + </>(4) + <j>( 6) + 
<J>(12) =1 + 1 + 2 + 2 + 2 + 4= 12. 

10. Determine the degree over Q of the primitive nth 
roots of unity for n = p and n = p 2 , where p is a prime. 

For n = p, they are the roots of (jc^ — 1)/(jc — 1) 
= x p ~ l + x p ~ 2 + • • • + 1, which was shown in Problem 
8 of Section 2.1 to be irreducible over Q. Hence in this 
case they have degree p — 1. 

For n—p 2 , they are the roots of (x p2 - \)/{x p - 1) 
— x p(p ~ >) + x p(p-i) + . . . +1, This equation is irreduc- 
ible, as can be seen by means of the change of variable 
x - u + 1. For, 

2 (« + iy ,(/ ’ _ * ) = 2 [(u + \yy- k 

k - I A = I 

= 2 + 1 +pg(“)Y~ k 

k = 1 

- t [K+1 y- k +ph k (u)] 

k = 1 

where each h k (u) is a polynomial of degree p 2 — pk — 1 
with integral coefficients. Using the formula for the sum of 
a finite geometric progression, 


2 (u + 1 y (p ~ k) 

A- 1 


2 ( U p + 1 /-* + pH(u) 


A = 1 


( u p + 1 y - 1 


+pH(u) 


= u p(p - l) + pG(u) 
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for some polynomial G of degree p 2 - p - 1. To apply the 
Eisenstein criterion, it simply remains to show that the 
constant term of the entire expression is not divisible by 
p 2 . Although we could trace this value through the compu- 
tations, it is easier just to plug u = 0 into the original sum, 
which gives the constant term = p. Now by the Eisenstein 
criterion the polynomial in u, and hence in x, is irreduci- 
ble. Therefore the degree over Q of the primitive p 2 roots 
of unity is p(p -1). 

11. If k and n are relatively prime positive integers, 
show that 2/* = 0, where the r/s are the n distinct nth 
roots of unity. 

The numbers r* are obviously roots of unity themselves, 
since (r*) n = (/•")* = 1* = 1. They are also distinct, for if 
r* - r k , then (rj rj) k - 1. This combined with (rj rff = 1 
would make r,/r y a primitive mth root of unity for some m 
dividing both k and n. Hence m * 1 and r, = r Jt Therefore 
2^* which equals the negative of the coefficient of 
x n ~ l in the polynomial Jt n -1. (Section 1.5.) This 
coefficient is 0. 

*12. Determine the degree over Q of each of the 
primitive 20th roots of unity. ( This is not the same question 
as in Problem 6.) 

The degree is 8. We begin by factoring x 20 - 1 = (x 10 - 
l)(x 10 + 1); the primitive 20th roots must be roots of the 
second factor. Further, x 10 + 1 «® (x 2 + l)(x 8 - x 6 + x 4 - 
x 2 + 1), from which it is clear that the primitive 20th 
roots, of which there are precisely 8 (Problem 6), are the 
roots of x 8 - x 6 -I- x 4 - x 2 + 1. However, we don’t yet 
know whether this can be factored further over Q, and the 
Eisenstein criterion does not apply even with the kinds of 
variable changes used in Section 2.1 and in Problem 10. 
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Since any primitive 20th root of unity may be written as 
a power of any other, they all have the same degree over 
Q and in fact Q(o>,) = Q(to 2 )- Consequently, if x 8 - x 6 + 
x* — x 2 + 1 can be factored in Q[x], its irreducible factors 
must all have the same degree. Hence there are either 8 
linear factors, 4 irreducible quadratic factors, or 2 
irreducible quartic factors. In any of these cases, this 
polynomial can certainly be written as the product of 2 
quartic factors (although not necessarily irreducible ones). 
That is, we can write 

x 8 - x 6 + x 4 - x 2 + 1 

= [ax 4 + bx 3 + cx 2 + dx + e] 

X [Ax 4 + Bx ? + Cx 2 + Dx + £] 

for some rational and, by the Lemma of Gauss, integral 
coefficients a, b, c, d, e, A, B, C, D, E. By comparing 
corresponding coefficients we shall be able to obtain 
a contradiction, thus showing that the primitive 20th roots 
of unity have degree 8 over Q. In particular, taking 
a = A = 1 without loss of generality, we can conclude 
rather easily that b = - B,d — - D, c = C, and e = E 
— ± 1. Further, from — 1 = 2C — JS 2 it follows that B is 
odd and that, since 2C = (B - l)(B + 1), C is even. At 
the same time, however, from 1 = 2£ - 2i?D + C 2 it 
follows that C is odd. This gives the desired contradiction. 

(It turns out that the <p(n) primitive nth roots of unity 
always satisfy a single irreducible polynomial over Q but 
it is rather difficult to prove this. See Problem 5 of Section 
2.7.) 

*13. Show that the degree over Q of each of the 
primitive nth roots of unity when n = p m , a power of a 
prime, is exactly <p(n). 
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The roots in question are precisely the roots of the 
polynomial 

/(*) = x Cp~2)p-' 


+ • • • + x p +1 

whose degree is <p(n). (Cf. Problem 4.) Thus it suffices to 
show that / is irreducible over Q. Note that /( 1) = p. If 
there were a nontrivial factorization f(x) = g(x)h(x), by 
the Lemma of Gauss we could assume that g and h had 
integral coefficients. In this case, when x — 1, one of them 
has value ± 1 and the other ± p. In fact, without loss of 
generality we may assume that g(l) = 1. Now g has at 
least one root, which must be a primitive nth root of unity. 
Call it w. The powers of u from u 1 tou"" 1 must include 
all the primitive nth roots of unity, along with other nth 
roots of unity. Therefore the polynomial 


contains among its roots all the roots of /, and thus is 
divisible by /. So we can write 


G(x) 

/(*) 


= q(x) 


for some polynomial q. Since / is primitive. Problem 1 1 of 
Section 2. 1 implies that q has integral coefficients, so that 
^(1) must be an integer. However, we find that 

m C(l) i 

q{ ) /(l) P ’ 


a contradiction which shows that / could not have been 
reducible. (The result of this problem holds for all n, not 
just powers of a prime. See Problem 5, Section 2.7.) 
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Section 2.5 

1 . Give a proof of the multiplication and division parts 
of Lemma 22 a using the polar representation of a, and a 2 . 

If a, = \a\\e‘ e ' and a 2 = \a 2 \e‘ 01 , then a { a 2 = 
|a,| \a 2 \e‘ < - e ' +9 ^, whose modulus is constructible since it is 
the product of constructible numbers and whose argument 
is constructible since it is obviously possible to add two 
angles. Since aj a 2 = (\a { \/\a 2 \)e' (9 '~ 9:i) , the argument 
here is very similar. 

2. Give a proof of Lemma 22 b which does not use the 
polar representation of a. 

Letting a = b + ic and k = r + is, we simply solve for r 
and s in terms of h and c. We have k 1 = (r + is)(r + is) 
= (r 2 - s 2 ) + i(lrs) = b + ic, so that r 1 — s 2 = b and 
2 rs = c. Hence r = c/2s, so that c 2 - 4s 4 = 4bs 2 , from 
which we can solve for s by means of two quadratic 
equations, the steps therei n corre sponding to admissible 
constructions. Since r = ±\s 2 + b , r is also constructible. 

3. Give an analytic proof of Lemma 2Aa. 

If w is a primitive nth root of unity, then for n = mk, w k 
is easily seen to be a primitive mth root. Since by the 
hypothesis co is constructible, so too is w*. 

4. Prove that the regular pentagon is constructible. 

The primitive fifth roots of unity satisfy the irreducible 
polynomial equation (jc 5 - l)/(x - 1) = x 4 + x 3 + x 2 + 
x + 1 = 0. This is a “reciprocal equation”, meaning that 
the reciprocal of any root is also a root; for, dividing by 
x 4 , we obtain 1 + (1 / x) + (1/x) 2 + (1 /x) 3 + (1/ x) 4 = 0. 
In such cases the change of variables u = x + ( 1/x) 
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halves the degree (which must be even to start or else the 
equation would be reducible, having +1 or - 1 for a 
solution). Dividing the original by x 2 , we have 

x 2 + x + 1 + x _I + x -2 = 0. 

The substitutions, 

u = x + x~\ 
u 2 = x 2 + 2 + x~ 2 , 

lead to u 2 + u - 1 = 0. Thus u can be obtained by the 
solution of a quadratic, and from u, x can be obtained 
likewise. Hence the roots are constructible. (By carrying 
through the computations, one could actually describe the 
construction.) 

5. Is it possible to divide an angle of 60° into five equal 
parts ' ? 

Yes. From the previous problem 72° = 360°/5 is 
constructible, and 12° = 72° - 60°. 

Section 2.6 

1. Use congruences to reformulate and prove the 
theorem that every perfect square is of the form 4 n or 
4« + 1. 

For any a, a = 0, 1, 2, or 3 (mod 4). In these cases 
a 2 = 0, 1,0, 1 (mod 4) respectively, which is equivalent to 
the theorem. 

2. If a ^ 0 (mod n) and b ^ 0 (mod n), does it follow 
that ab ^ 0 (mod /?)? 

No, 3 ^ 0 (mod 6) and 2^0 (mod 6), but 6 = 0 (mod 
6). This is equivalent to the fact that n may divide a 
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product of two numbers without dividing either one. Of 
course, if n is prime, this can no longer happen. 

3. If a and n are relatively prime, show that there exists 
an integer b such that ab = 1 (mod n). 

The number b = a ^ n) ~ 1 will do, by Fermat’s Theorem. 
For a direct proof, note that in the list a, a 2 , . . . , a n+ ', 
some two entries must be congruent, for each is congruent 
to one of the n numbers 0, 1, . . . , n — 1. But if a j = a k 
(mod «), with, say, k > j, then 1 = a k ~ J = a ■ a k ~ J ~ 1 (mod 
n), so that we can take b = a k ~ J ~ ] . 

4. With the obvious definition, does there always exist a 
primitive root modulo n, when n is not prime ? 

No, and n = 8 is the smallest counterexample. In 
general, we would want a to have order <j>(n) modulo n. 
Here we have </>(8) = 4, but for every a, 1 < a < 7, with 
(a, 8) = 1, there is a lower power congruent to 1. The a' s 
are: 1, 3, 5, and 7; and by computation we find that 
1' = 3 2 = 5 2 = 7 2 ee 1 (mod 8). 

In number theory it is shown that the only numbers n 
having primitive roots modulo n are those of the form p k , 
2 p k , 2, and 4, where p is an odd prime. 

5. Find all the primitive roots modulo 1. 

We are looking for numbers of order <#>(7) = 6 modulo 
7. By Theorem 27 there are <f>(p — 1) = <J>(6) = 2 of them. 
By trial and error we find two such to be 3 and 5. As 
alternatives, of course, we could take any numbers 
congruent to these modulo 7. 

6. How many primitive roots modulo 17 are there ? Find 
one. 

There are <f>06) = 8 of them. They are: 3, 5, 6, 7, 10, 1 1, 
12, and 14. 
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Section 2.7 

1. Carry through the proof of Theorem 28 for the case 
p = 5, and deduce from it an algorithm for the construction 
of the regular pentagon. 

Let to be a primitive fifth root of unity. A primitive root 
modulo 5 is g = 2, and so we can list the four primitive 
fifth roots of unity as: to 2 , to 4 , to 3 , to 1 . Taking in particular 
to = e 2 "'/ 5 , these are sketched in Figure 17. Proceeding as 
in the proof, 17 , = to 2 -I- to 3 and tj 2 = to 4 + to 1 , so that 
Vi + V 2 — — 1 and = — 1. Thus tj, and tj 2 are the roots 
of x 2 + x - 1. Since, as is clear from the diagram, tj 2 > 17 ,, 
we have tj, = (- 1 — )/2 and i ) 2 = (- 1 +V5" )/2. Two 

of the 2-periods are just to 4 and to 1 . Their sum is tj 2 and 
their product is 1. Thus we see that they are the roots of 



Figure 17 
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x 2 - t] 2 x + 1> from which it follows that 

m+ivi ~ 4 

to = . 

2 

Thus the algorithm is to construct successively tj 2 and then 
co. 

2. Generalize the method of this section to show that 
the primitive seventh roots of unity may be obtained by the 
successive solution of polynomials of degrees 2 and 3. (You 
are not asked to solve these lower degree polynomials .) 

The number g = 3 is a primitive root modulo 7. Letting 
co be a primitive seventh root of unity, we can list all the 
primitive roots in the usual way as co 3 , co 2 , co 6 , co 4 , co 5 , co 1 . 
We break this set into three groups, taking every third 
one, and thus obtain periods 

T), = to 3 + CO 4 , 

7) 2 = CO 2 + CO 5 , 

7] 3 = W 6 + CO 1 . 

We shall find a cubic with these three periods as its roots 
by evaluating the three elementary symmetric functions of 
three variables at these values. That is, 

Vi + V2 + ^3 = “ 1 

ViVi + V\Vs + V2V3 = [w 5 + w 1 + co 6 + co 2 ] 

+ [co 2 + co 4 + co 3 + co 5 ] 

+ [<o' + co 3 + co 4 + co 6 ] 

= -2 

V1V2V3 = co 4 + co° + co 5 + co 1 + co 6 + co 2 + co° + co 3 

= 2 + (-l)=l. 
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Consequently, ij,, r) 2 , and tj 3 are the roots of x 3 + x 2 — 
2x - 1. Now each of these 1-periods may be decomposed 
into two 2-periods, each of which is a power of u. These 
can be found as the solutions of quadratics. For example, 
u 3 + « 4 = i], and (<o 3 )(co 4 ) = 1, so that to 3 and w 4 are the 
roots of x 2 - tj,x + 1. Similarly for the others. 

(Actually, for any prime p, this method reduces the 
solution of the pth cyclotomic polynomial to the solution 
of equations whose degrees are the prime factors of p — 1 . 
It can also be shown that all the resulting equations can 
actually be solved explicitly, but this topic is best 
addressed in the more general context of Chapter 3.) 

3. If a is a primitive mth root of unity and if b is a 
primitive nth root of unity , and if (m, n) — 1, show that ab is 
a primitive mnth root of unity. Use this fact to give an 
alternate proof of Lemma 28 a. 

Since a m - 1 and b n = 1, it is clear that ( ab) mn - 1; we 
simply need to show that mn is the minimal such positive 
exponent. Suppose that for some k > 1, (ab) k - 1; we will 
show that (mn) \ k. Since a k = b~ k , we have 1 = a km 
= b ~ km , and hence, by the same reasoning as in the proof 
of Theorem 21, n | (-km). Since (m, n) - 1, in fact n \ k. 
Similarly, we get m | k. But since (m, n) — 1, it follows 
that (mn) \ k. To complete an alternate proof of Lemma 
28a, we note that there we are essentially given the 
constructibility of a and b. By Lemma 22a, ab is 
constructible, and hence so is the regular mn- gon. 

*4. Find a necessary and sufficient condition on a 
rational number a such that an angle of a degrees is 
constructible. 

If a is written M/N in lowest terms, then for a to be 
constructible it is necessary and sufficient that 3 divide M 
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and that N have the form 2 k p x p 2 - • • p m where the p’s are 
distinct Fermat primes other than 3 or 5. (Of course we 
may have m = 0, in which case N is just a power of 2.) 
The proof consists essentially of an application of 
Theorems 24 and 28 (on the constructibility of regular 
n-gons). Important tools are the Euclidean algorithm and 
the observation that an angle of 3° is constructible (the 
central angle of a pentagon is 72°; subtract 60° and bisect 
twice) but that 1 ° is not (for then the multiple 20° would 
be, which was shown impossible in Section 1.3). The 
degree symbol will be omitted in what follows. 

( Necessity .) If M/N is constructible, then so is M. If 
(A/, 3) = 1, then we could write Mr ] + 3s, = 1 for integers 
r, and j, and so 1 would be constructible, which is 
impossible. Hence 3 | M. For convenience we will write 
M = 3Q; of course (Q, N) = 1. But then Qr 2 + Ns 2 = 1 
for integers r 2 and s 2 , and so 3 Qr 2 /N + 3$ 2 = 3/ N. Since 
the first term on the left is a multiple of a and the second 
is a multiple of 3, we see that 3/N is constructible. 
Therefore the multiple 360/ N is constructible, from which 
Theorem 24 implies that N has the form 2 k p\p 2 • • • p m . It 
remains to show that none of the primes can be 3 or 5. 
The case of 3 is excluded since 3 | Af and (A/, N) — 1. For 
the case 5 1 7S7, it would follow that 3/5 is constructible 
since it is a multiple of 3/N. But then 24(3/5) = 72/5 
would be constructible, which is impossible since this is 
the central angle of a regular 25-gon, which is not 
constructible. 

( Sufficiency .) Let a have the given form. We know 
that 360/(2*- 3 • 5 •/>,••• p m ) is constructible by 
Theorem 28. This is just (3 • 8)/(2 k p x p 2 ■ • • p m ) which can 
be bisected thrice and then multiplied by Q to give the 
angle a. 
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*5. Show that all the <f>(n) primitive nth roots of unity 
are the roots of a single irreducible polynomial over Q which 
is monic and has integral coefficients. Use this result to give 
an alternate proof of Theorem 24. (Hint : This problem does 
not depend specifically on the results of this section. It was 
not included in an earlier section simply so the reader would 
have more experience with some of the material before 
attempting it. Special cases were encountered in the 
problems in Section 2.4.) 

The polynomial x" — 1 can be factored into irreducible 
polynomials; by Gauss’ Lemma they may be taken to 
have integral coefficients, in which case they must also be 
monic. If <o is a primitive nth root of unity, then the factor 
having w as a root will be called /. We want to show that 
this irreducible / has exactly all the <f>(n) primitive nth 
roots of unity as its roots, and thus that it is the nth 
cyclotomic polynomial. 

Consider the polynomial f k (x) = f(x k ), which is also 
monic and has integral coefficients. By the division 
algorithm we can write f k = qj + r k where either r k — 0 or 
deg r k < deg/. In addition, since / is monic, Problem 12 
of Section 2.1 implies that q k and r k have integral 
coefficients. If we do the division with some other /, 
j = k (mod n), the resulting is identical with r k . To see 
this note that (f k - /)(«) = f(w k ) - /(u y ) = 0 and so 
/ 1 (f k — f). Thus the remainder when we divide f k — f by 
/ is 0; but this remainder is simply r k - rj. In conclusion, 
as k ranges through all the positive integers, there are only 
n different remainders r,, r 2 , . . . , r n that we may obtain 
as above. Let M be an integer greater than the absolute 
value of every coefficient in every one of these r,’s. We 
claim that for every prime p > M,f\f p . To see this, look 
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at the polynomial f p ,f raised to the pth power. It is easy to 
see that it has the form f p = f p + pg for some polynomial g 
with integral coefficients, and we can further write 
g = sf + 1 for polynomials s and t with integral 
coefficients, / = 0 or deg t < deg /. But now we have two 
representations of f p :f p = q p f + r p , and = f p - pg 
= f p - psf - pt = (f p 1 - ps)f - pt. By the uniqueness in 
the division algorithm it follows that -pt = r . But r p 
equals one of r, through r n , and so if p > M, the only way 
all its coefficients can be divisible by p is if r = 0. That is, 
for all p > M, f | f p . 

Every one of the primitive nth roots of unity has the 
form «* for some k, 1 < k < n — 1, such that ( k , n) = 1. 
Of course if j = k (mod n), then - w*. Thus let us fix k 
as above and seek such a j for which we can prove that o' 
is a root of /. I et P denote the product of all primes less 
than or equal to M except those which divide k. We claim 
that j - k + nP is an appropriate j. (Obviously j = k (mod 
n).) Every prime factor of j must be greater than M, for 
any smaller prime will be a factor of exactly one of k and 
nP. Thus the prime factorization j = p x p 2 ' • • p m , possibly 
with repetitions, includes only primes greater than M. By 
our earlier result then, each p t is such that f\f pr Now we 
repeatedly apply this fact. Since w is a root of /, it is a root 
of f pp that is, <o* is a root of /. But then u p ' is also a root 
of f , which implies that (u p, y 2 = u p ' Pl is a root of /. 
Continuing in this fashion, we obtain that is a root of /, 
which completes the proof. 

To prove Theorem 24 we want to determine for what 
values of n is a power of 2. It follows easily from the 
definition of <f> that if j and k are relatively prime, 
then 4>{jk) = §(j)<p(k). Therefore, if n has the prime 
power decomposition n = p^'p ? 1 ' ' ' Pv*> then <j)(n ) 
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= YliPr\Pi ~ 0- Thus if Pi is an odd prime, we must 
have both m l = 1 and p, - 1 equal to a power of 2. This is 
exactly what we want. 

Section 3.1 

1. Show that the general cubic equation ax 3 + bx 2 + 
cx + d = 0 can be solved by radicals. (Hint: Without loss of 
generality, take a = 1. Make the successive substitutions 
x = y + L and y = z + K/ z for appropriate choices of L 
and K.) 

Without loss of generality we can take a = 1, for 
otherwise we could divide through by it. (Since it is a 
cubic, a # 0. If it were 0, we would have at most a 
quadratic, which is solvable.) The substitution x — y + L 
leads to the equation 

y 3 + J 2 [3L + 6] +/[3L 2 + 2 bL + c ] 

+ [L 2 + bL 2 + cL + d]= 0. 

The y 2 term drops out by taking L = -bf 3. So the 
equation is now in the form 

y 3 + By + C = 0. 

If C = 0, its solution is apparent. If C # 0, then y = 0 is 
not a root of this equation. The substitution y = z + K/ z 
leads to 

z 6 + z 4 [3/C+ B] + Cz 3 + z 2 [?>K+ B]K+ K 3 = 0. 
Two terms drop out by the choice K = - 5/3, leaving 


z 6 + Cz 3 + K 3 = 0, 
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or 

(z 3 f + C(z 3 ) + /l 3 = 0. 

This is a quadratic in which we can solve for z 3 . Thus z 
can also be obtained by radicals. From z it is easy to get 
back to x. 

(There remains a question of which square root and 
which cube root to take in the above in order to get a 
(non-extraneous) solution to the equation. This question 
goes beyond that of whether the equation is solvable by 
radicals, but in any case for a given equation the actual 
roots can be determined by trying all six combinations.) 

2. What is the degree of the splitting field of x 3 — 2 as 
an extension of Q? Find a basis for this extension. 

If to is a primitive cube root of unity, then the splitting 
field is given by E = Q(\^T, u ^2 , ui 2 ^!). We can build 
up to E by two simple extensions of Q. Let F, = Q(w), 
which is an extension of degree 2 since u is a root of 
(x 3 - l)/(x - 1) = x 2 + x + 1. A basis for this extension 
is { 1, «}. Since the irreducible (over Q) polynomial x 3 - 2 
cannot have a root in the quadratic extension F,, it is 
irreducible over F^ so that with F 2 = F,(^ ), a basis for 
F 2 over F, is { 1, ^2 , ^4 }. Clearly, F 2 = E, and as in the 
proof of Theorem 19, we may obtain a basis for E over Q 
by multiplying the elements of the bases of the two 
successive extensions to obtain {1, \/2”, ^4 , w, oyfl , 
«^4 }. Thus the degree of E over Q is 6. 

3. Find an irreducible polynomial over Q whose splitting 
field has degree less than n\ over Q, where n is the degree 
of the polynomial. ( Cf . Problem 8 of Section 2.2.) 

One such polynomial is x 4 - 2. If w is a primitive 4th 
root of unity, all its roots are given by { ^2 , w 1 ^2 , 
u 2 ^2, a 3 ^2}. The splitting field E can be built up in 
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two steps. Let F, = Q (i), which is of degree 2 but contains 
to (since u = ± /). Let F 2 = E. Then [E : Q] = [E :F,] 
[F, : Q] < 4 • 2 - 8 < 4! = 24. We note that [E : F,] < 4 
since any extension of F, with a single root of x 4 -2 
contains all its roots. 

As another example, consider the splitting field of 
x P~ l + x p ~ 2 + • • • + x + 1, which is irreducible and 
whose roots are the primitive />th roots of unity, where p is 
a prime. An extension by any root contains all the roots, 
and so the degree of the splitting field is just p— 1, 
precisely the degree of the polynomial. 

*4. Show that the general quartic equation ax 4 + bx 3 + 
cx 2 + dx + e = 0 may be solved by radicals. ( Hint : 
Without loss of generality, take a = 1. Let y - x + (6/4), 
and then try to write the resulting equation as a difference of 
squares, making use of some extra parameter.) 

Following the suggestions, we obtain for y an equation 
of the form y 4 + Cy 1 + Dy + E = 0. Adding and subtract- 
ing y 2 t + 1 2 / 4 on the left, where t is a parameter, we 
obtain ( y 2 + t/2) 2 ~[y 2 (t — C) — Dy + t 2 /4- E ] = 0. We 
can make the expression in the brackets a perfect square 
by choosing t so that its roots are equal, or equivalently, 
by making its discriminant (- D) 2 -4(t — CXf z /4- E) 
equal to 0. This latter gives a cubic equation for t: 

— t 3 + Ct 2 +4Et + D 2 - ACE = 0. 

By Problem 1, this can be solved in radicals. Let t 0 be one 
of the roots of this cubic. Then 


y 2 (t 0 — C) — Dy + to/4- E = (t 0 - C) 


D 


2 (t 0 ~C) 


as long as t 0 *C. But it is easy to verify that if all three 
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roots equal C, then C = D = E = 0, and thus y = 0 is the 
solution. Therefore we obtain by factoring our difference 
of squares 


/+r 0 /2 j ( , o ° c) )|. 


Setting each factor equal to 0, we can obtain / in radicals. 
The question of which roots to take in different parts of 
this solution can be analyzed, but we do not do it here. 
The above calculation shows completely that any x that is 
a solution can be expressed by radicals. (The cubic we 
encountered in t is called the resolvent cubic.) 

5. Show that there exists a number which has degree 4 
over Q but which is not constructible. (Hint: Consider the 
roots of f(x) — 4x 4 - 4x + 3.) 

We first divide the suggested polynomial by 4. From the 
calculations in the previous solution, we see that the 
resolvent cubic for / is - 1 3 + 3t + 1, which is irreducible. 
Thus none of its roots can be constructed. Howev er, in 
the notation of the previous problem, V/ 0 + C =v^ 
= y { +>'2 = + * 2 ’ where x, and x 2 are roots of the 

original polynomial. Similarly, = - jc 3 - x 4 , where x 3 
and jc 4 are the other roots of /. Thus if all of the roots of / 
were constructible, /„ would also be constructible. Since t 0 
is not constructible, neither is some root of /. Since / has 
no rational roots, if it were reducible it would be as a 
product of quadratics over Q, and thus all the roots would 
be constructible. Since they are not, / is irreducible, and 
so each root has degree 4 over Q. (It can be shown that 
none of the roots of / are constructible.) 
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Section 3.2 

1. Suppose <p is a mapping from a field F to itself such 
that <f> is one-to-one, onto, and preserves addition and 
multiplication (i.e., <p(a + b) = <p4a) + <t>(b) and 4fab) 
= <f>(a)4>(b) for all a, b E F). Show that tjf 0) = 0, <f>(l) = 1, 
and that <f> is an automorphism of F. 

Since </>(0) + <£(0) = <#>(0 + 0) = <#>(0) we conclude that 
<#>(0) = 0. Similarly, <|>( 1)^(1) = £(1 • 1) = <J>(1) so that 
^>(1) — 1, provided of course that <£{1) # 0, which is true 
since <f> is one-to-one and <j>( 0) = 0. To show that ef> is an 
automorphism, it remains only to show that it preserves 
subtraction and division. We have 0 = 4fb - b) — <j>(b) + 
<j>(— b) so that 4 >(-b) = ~ 4 >(b). Thus 4 fa - b) — 4fa) + 
4>( — b) — 4>(a) — 4>(b). Similarly, for b ^ 0, 1 =4fb/b) 
= <[>(b)4>(l / b) so that 4>(l/b) = l/4>(b). Thus 4fa/b) 
- 1/*) = <f>(a)/4>(b). 

2. Let 4> be an automorphism of a field F. Show that 
for every rational number q, 4f q) = q. ( That is, 4>, restricted 
to Q, which is a subfield of F, is the identity mapping.) 

From the fact that <#>(1) = 1 it follows from the additive 
property of 4> that 4>(n) = n for any positive integer n. For, 
4>(2) = 4fl) + 4fl) = 2 and the induction argument is 
obvious. For negative integers, the result follows from the 
fact that 4>(— n) — ~4fn), as shown in the previous 
problem. Thus, 4fm/n) = <j>(m)/4>(n) = m/n. 

3. Find all the automorphisms of Q(yj2 ). 

There is only one, the identity mapping. For, if 4> 
is an automorphism of Q(\^), 0 = <J>(0) = 4>[(]l2 ) 3 - 2] 
= [<K 3 V2)f - 4>(2) = [<K 3 V2)] 3 - 2, by Problem 2. Thus 
4>(t/2 ) is also a cube root of 2. But since Q(^2 ) c R, and 
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since there is only one real cube root of 2, ) = ^2 . 

If s is an arbitrary element of Q(\f2), we know we can 
write s = a + b(yj2 ) + c(y2 ) 2 , where a, b, and c are 
rational. But then <p(s) = s, so <j> = id. 

4. Find all the automorphisms of Q(V2 ). 

By the same reasoning as used in the last problem, 
<p(f2 ) is a square root of 2. But in this case there are two 
possibilities, ) = ± f2 . It is clear that each of these 
possibilities determines at most one automorphism, since 
Q(V2 ) = {a + b^2 | a, b £ Q}. If <f>(V2" ) = V2" , then 
<j>(a + bjl ) = a + bfl . This yields the identity auto- 
morphism. If <f>(ySr ) = -fl , then <j>(a + bfl ) = a - bfl . 
Is this an automorphism? We have to check the 
preservation of addition _and multiplication. For addition, 
<f>[(a + bf 2 ) + (c + dfl )] = <f>[(a + c) + (b + d)fl ] = 
(a + c) - (b + df2 ) = (a - b^2 ) + (c - dfl ) = <p(a + bfl ) 
+ <f>(c + cbjl ). Similarly, it is easy to verify that <j> 
preserves multiplication. Since <j> is also one-to-one and 
onto, it is an automorphism. Thus there are two 
automorphisms of Q(V2 ). 

5. Prove Theorem 29 using right cosets. ( If H is a 
subgroup of G, a right coset of H is a set { } where the 
xpj's range through H and </» is a fixed element of G. 

The proof may be repeated almost word for word, 
except that to show distinctness one needs to multiply 
through by inverses on the opposite side. 

6. Let G be any group, finite or not, and let H be a 
subgroup of G. If <j> €E G, then the set 4>H = {<j>f | \p G H) 
is called the left coset of H by <J>. Show that any two left 
cosets of G are either identical or disjoint. 
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Let <f> t H and <j> 2 H be two left cosets. Suppose they have 
some element in common, call it t. Then for some 
automorphisms e 7/, we have 4>,ip, = t = <f> 2 ^ 2 , and 
thus <(>! = (t> 2 \p 2 \[/{-' E<t> 2 H. But this implies that <}>,// 
C <#> 2 //, for if is an arbitrary element of <£,//, we have 

= V) e <£ 2 #- Similarly, <j> 2 H C$\H. Thus if 

<(>, H and <j> 2 H have a single element in common, then 
<h H ~ $2 H - 

7. If G is a group whose order is a prime p, how many 
subgroups does G have ? 

It has two subgroups, G and {id}. By Lagrange’s 
Theorem there can be no others. 

8. If G is a cyclic group of order n, determine exactly 
how many elements of G generate G. 

Let \p be one generator of G. Then we claim that \p k is 
also a generator of G if and only if ( k,n)= 1. For 
certainly, (\p k ) n = id, and the elements \p k , (\p k ) 2 , 
. . . ,(ip k ) n will be distinct if and only if n is the least such 
positive power. Since \p is a generator, (i p k ) m — id if and 
only if km =0(mod n ). If (k, n) = 1, then n \ m, and so 
m> n. Conversely, if k and n have a common factor j, 
then (> p k ) n/ J = id. The claim thus having been established, 
we simply observe that among G = {\p, \p 2 , . . . , = id) 

there are precisely <p(n) terms with exponents relatively 
prime to n, where here <p is the Euler ^-function. 

9. If G is a cyclic group of order n, and if m\n, show 
that G has a cyclic subgroup of order m. 

For any generator tj> of G, we have G = {<}>, <p 2 , 
. . . , <p n }. If n/m = k, define ^ = <p k . Then the cyclic 
subgroup generated by vf clearly has order m, since 
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i//, \p 2 , . . . , ip m are all distinct, being among the list 
</>, <(> 2 , . . . , 4>"\ and also, \p m = </>" = id. 

10. Show that every subgroup of a cyclic group is cyclic. 

If we write G = {<>, <#> 2 , <j> 3 , . . . , «£>"}, then H is of the 
form {</>', <t>f = id}, where, without loss of 

generality, we may assume that i <j < • ■ • < n. Then H 
is the cyclic subgroup generated by 9 '. To see this, 
observe that if <j> k £= H, we can write k = iq + r, 
0 < r < i, and compute <j> r = <j> k <p ~ iq E H. By the mini- 
mality of i, r - 0 and so k is a multiple of 1 . Therefore, 
<p k = (<J >, ) ? and thus belongs to the cyclic subgroup 
generated by <f>'. 

11. If \G\ = p, a prime, show that G is cyclic. 

Let <f> G G, <f> + id. Then the cyclic subgroup generated 
by <p must have order p by Lagrange’s Theorem. 

12. If H is a subgroup of prime index p in G, show that 
other than for H and G, there is no subgroup H such that 
H C H C G. 

By Lagrange’s Theorem, | H[ = k\H\ for some integer 
k> 1. But \G\ — p\H\ and \H\ divides |G|. Therefore 
k = 1 or p. 


Section 3.3 

1. If p is a prime, show that the Galois group of 
x p ~ l + x p ~ 2 + • • • + 1 = ( x p - \)/(x - 1) is cyclic and 
of order p — 1 . 

The splitting field of the polynomial is Q(w), where w is 
any primitive />th root of unity. By Theorem 31, the map 
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<f > : w->u k , for each k such that 1 < k < p - 1, deter- 
mines a unique element of the Galois group, whose order 
is therefore p - 1. (Recall from Problem 8 of Section 2.1 
that this polynomial is irreducible over Q.) If g is a 
primitive root modulo p, then the automorphism 
determined by the condition, <j> : w-*u g , is a generator of 
the Galois group. For we have, <j> 2 : <f> 3 : w-^co* 2 , 

etc., and the exponents will range through all congruence 
classes between 1 and p - 1. 

2. Determine the order of G( E/Q) where 
E = Q(V2\V^)- ( Hint : See Problem 7, Section 2.3, and 
Problems 1 and 2, Section 1 .4). 

By the problems cited, E = Q(V2~ + (or something 
similar, depending on one’s solutions), and fl + V3 has a 
minimal polynomial over Q, x A - 10x 2 + 1. However, E is 
actually the splitting field of this_ polynomial, whose 
complete set of roots is V2 ±VJ, ± JT . By Theorem 

31, |G(E/Q)| = 4. 

3. What is the order of the Galois group of x s — 1? 

It has the same splitting field, and hence the same 
group as, jc 4 + x 3 + x 2 + x + 1, which by Problem 1 has a 
Galois group of order 4. 

4. Is there some polynomial f E Q[x] such that 
Q(\/T ) is its splitting field ? 

No. If Q(^2 ) were the splitting field of /, then / would 
have to have an irreducible factor in Q[x] of degree greater 
than 1, and any root r of such a factor would satisfy 
Q(r) = Q(\/2 ) by a simple degree argument (Theorem 19). 
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If r x and r 2 were two such roots, they would generate a 
nontrivial element of <7(Q(v/2)/Q) by Theorem 31, con- 
trary to the result of Problem 3, Section 3.2. 


5. // <f> £ (7(E/F), where E is a finite extension of F, 
and if K is an intermediate field, show that the set 
<f>(K) = (<P(a) | a £ K} is an intermediate field. Show further 
that [K : F] = [<KK) : F]. 

1 G <KK), since <#>{1) = 1. <j>(a) + 4>(b) E < p(K), since 
<j>(a) + 4>(b) = <p(a + b). Similarly for the other operations. 
Now, <j> preserves linear independence over F, for 


2c,w,= 0 if and only if <f> 

i 


2 c,u, 


■ 2 <#(«/) = °> 


where the c,’s G F and the vectors u : e K. Thus, a maximal 
linearly independent set in K over F corresponds to a 
maximal linearly independent set in <KK) over F. 

6. Let F be a field. Numbers a t , a 2 a n are said 

to be algebraically independent over F if there is no 
nontrivial polynomial p(x v x 2 , . . . , x n ) over F such that 
p(a v a 2 , . . . , a n ) = 0. Show that if F is countable, then for 
any n there exist n algebraically independent elements over 
F. 


If F is countable, we have seen (Problem 4, Section 1.5) 
that the set of all numbers algebraic over F is countable. 
Since C is not countable, we can pick a number a, which 
is transcendental over F. It is easy to see that F(a,) is 
countable. By induction, then, we can construct a 
sequence a v a 2 , . . . , a n such that each a k is transcen- 
dental over F(«|, a 2 , . . . , To show that these 

elements are algebraically independent over F, suppose 
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there is a nontrivial relation p{a v a 2 , . . . , a n ) = 0. If k is 
the largest subscript appearing in a nonzero term, this 
implies that a k is algebraic over ¥(a { , a 2 , ■ ■ ■ , a k _{), a 
contradiction. 

7. Let a,, a 2 , . . . , a„ be algebraically independent 
oxter a field F. Define E = F(a,, a 2 , , a n ). Show that for 

every permutation of the numbers a v a 2 , . . . , a n there exists 
a unique <j> €5 G(E/F) which has the effect of this 
permutation on the a’s. 

Since any permutation can be obtained by a sequence 
of transpositions, we need only treat the case of a 
transposition. Furthermore, since the a's are algebraically 
independent over F, it is easy to see that they can be 
adjoined to F in any order to obtain E. Let us suppose 
that we are interested in the transposition a,<=>a*. Then 
we may write E = K(a„ af) = K(c, d), where c = a„ d = a,, 
and K is F extended by all the other a’s. An arbitrary 
element of K has the form p(c, d)/q(c, d), where p and q 
are polynomials over K. The obvious candidate for <j> is 
the map 

P( c ’ d ) P( d ’ c ) 

q(c,d) q(d,c) ’ 

To show it is well defined, suppose pfc, d)/qfc, d) 
= p 2 {c, d)/q 2 (c, d). By algebraic independence over K, 
which is apparent, the polynomial P\q 2 ~ p 2 q\ is identi- 
cally 0. Thus p l (d > c)/qfd,c) = p 2 (d,c)/q 2 (d,c). The 
same reasoning may be used to show that <j> is one-to-one. 
It is obvious that <f> is onto and that it leaves F fixed; in 
fact, this <j> leaves K fixed. Preservation of addition and 
multiplication is very simple, and so the details are 
omitted. 
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8. How many automorphisms of R are there ? 

There is only one, the identity map. For suppose <f> is an 
automorphism of R. If a < b then it must be that 
<f>(a) < <f>(b), because — <f>(a) = (j>(b - a) = 4>(fb — a )• 
<i>(fb - a ) > 0. Also we know that any automorphism «f> 
leaves each rational number fixed. If r is any real number, 
let a n and b n be sequences of rationals approaching r from 
below and above respectively. From a n < r < b n we have 
<f>K) < </>(r) < <f>(b„ ) and so a n < <H> ■) < K Letting n-*co, 
we conclude that (jf r) = r. 

*9. If the field of complex numbers C is an algebraic 
extension of a field E, show that any automorphism of E can 
be extended to an automorphism of C. 

This problem is more advanced than most; it is needed 
to solve Problem 10. They may both be skipped without 
loss of continuity. 

Let <j> denote the given automorphism of E. If K is any 
field, a mapping a : K->C which is one-to-one and 
preserves the field operations is called an isomorphism of 
K. (If a(K) = K itself, then o is also an automorphism of 
K.) Define a set 5 of ordered pairs (K, o ) such that K is a 
field containing E and o is an isomorphism of K whose 
restriction to E is just <j>. The set 5 can be partially 
ordered by the definition (K,, a,) < (K 2 , o 2 ) if and only if 
K, c K 2 and o 2 restricted to K, is just o t . Given two 
elements of S, they are not necessarily comparable under 
the relation “ < ”. A subset of S such that all elements are 
comparable is called a chain, and every chain of S has an 
upper bound. To see this, denote the chain by (K a , o a ) for 
an index a. An upper bound is given by (K, o) where 
K = U Q K a and a(a) = a„(a) for any K a containing a. A 
famous principle called Zorn’s Lemma asserts that if 5 is 
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a partially ordered set in which every chain has an upper 
bound, then S has at least one maximal element (that is, 
an element s such that if s > s, in fact s = s). In the 
present case, this gives a maximal element which we shall 
call (K, a). (For Zorn’s Lemma, see P. Halmos, Naive Set 
Theory. Zorn’s Lemma is equivalent to the Axiom of 
Choice.) 

We will now _see that K = C. If not, let r be a complex 
number not in K. Now r must be algebraic over K (since 
C is an algebraic_extension), and so it has a minimal 
polynomial over K. Under d, the coefficients of this 
polynomial are transformed so as to give another 
pojynomial g, which is easily seen to be irreducible over 
o(K). Let t be any root of g. The mapping r — ► t induces 
an isomorphism of K(r), the straightforward verification 
of which is omitted, and this is an extension j)f o. This 
construction contradicts the maximality of (K, 5), thus 
implying_that K = C. 

Since K = C, it only remains to show that o is onto, for 
then it must be an automorphism of C. But if t e C, it has 
a minimal polynomial g over E, the coefficients of which 
under <j> ~ 1 lead to another irreducible polynomial / over E 
of the same degree. Since a is one-to-one and must map 
the roots of / to roots of g, it follows that t must be the 
image of one of them. Thus 5 is onto, and this completes 
the solution. 

*10. How many automorphisms of C are there! 

There are an infinite number. To be more precise, let c 
be the cardinality of the continuum (R, for example), and 
as is customary let 2 C be the cardinality of the set of all 
subsets of the continuum. Then there are 2 C auto- 
morphisms of C. The proof depends partly on Problem 9 
and also makes independent use of Zorn’s Lemma, which 
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was introduced in the solution to that problem. This result 
is included because it is quite striking, but the reader who 
lacks the appropriate background may skip it without any 
loss of continuity. 

Let S be the family of all sets which are each 
algebraically independent over Q, meaning strictly that 
each finite subset of each set is algebraically independent, 
according to the definition in Problem 6. If S is ordered 
by set inclusion, then each chain has an upper bound 
given simply by the union of the sets in the chain. Zorn’s 
Lemma now implies that there is a maximal element G in 
S. G is called a transcendence base of C over Q. 
Remember that it is a set of numbers algebraically 
independent over Q. Let E be the smallest field containing 
all the elements of G. Clearly C is an algebraic extension 
of E, or else G would not be maximal. 

First we construct automorphisms of E and then we use 
Problem 9 to extend each to an automorphism of C. Let / 
be a one-to-one mapping of G onto itself. Then / 
generates an automorphism of E in the following way. 
Every element of E has the form p(a, ft, . . .)/ 
q(a, ft, . . .), where p and q are polynomials over Q in 
some finite number of variables and a, ft, , denote 
some elements of G\ that this is true follows since the set 
of all such expressions is itself a field and is a subset of 
any field containing all of G. The automorphism 
generated by / is the mapping. 

p{a,p,...) p{f(a),f( ft ),...) 

?(«./*>••■) " < 7 (/(«)./(/*).•••) ' 

It must be shown that <f> is well defined, one-to-one, onto, 
and that it preserves the field operations. This depends on 
the algebraic independence of G and is quite straightfor- 
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ward. The details are essentially identical with those in the 
solution to Problem 7. By extending <#> to all of C, as per 
Problem 9, we obtain from each / an automorphism of C. 

Thus we now need to know how many such functions / 
there are. First observe that G must have cardinality c, for 
it is not hard to see that E has the same cardinality as G 
and also that C has the same cardinality as E (from the 
fact that C is algebraic over E). And of course the 
cardinality of C is c. Now the set of one-to-one mappings 
from a set of cardinality c to itself is 2 C . To see this let us 
look at such mappings from R to R. Let A be one of the 2 C 
subsets of the interval (0, 1). Then B = A (- cc, 0] and 
its complement D in R each have cardinality c. Thus there 
exist one-to-one maps from (- oo, 0) to B and from [0, oo) 
to D, the combination of which gives a one-to-one onto 
mapping from R to R. Since different sets A give rise to 
different mappings, we get 2 C mappings in this way. Thus 
there are at least 2 C one-to-one onto mappings / from G to 
G, and thus at least 2 C automorphisms of C. However, 
there can be no higher number of automorphisms of C 
because the total number of functions from C to C (not 
even necessarily one-to-one or onto) is still just 2 C . This 
can be seen by counting equivalently the functions from R 
to R, each of which uniquely determines a subset of R 2 (or 
C) as its graph. So the number of such functions is no 
more than the number of subsets of C, which is 2 C . 

Section 3.4 

1. // F, = F(r,) and F 2 = F(r 2 ) and if F, and F 2 are 
conjugate fields over F, does it follow that r x and r 2 are 
conjugates over F? 

No. The number r, has a fixed finite number of 
conjugates. However F(r 2 ) = ¥(qr 2 ) for every q G Q. Since 
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there are an infinite number of these numbers qr 2 , they 
cannot all be conjugates of r,. Thus r 2 need not be a 
conjugate of r,. 

2. If E is a finite extension of F, show that there are 
only a finite number of intermediate fields K, E D K D F. 
( This already appeared as Problem 12 of Section 2.2, but an 
alternate and much simpler proof is now accessible.) 

First we need to extend E to a normal extension of F. If 
E = F(r), let E be the splitting field of the minimal 
polynomial of r over F. Then EdEdKdF. But there 
are only a finite number of intermediate fields between E 
and F, for by Corollary 32, these are in one-to-one 
correspondence with the subgroups of G(E/F) which is 
finite and hence has only a finite number of subsets, let 
alone subgroups. 

3. Show that conjugacy is an equivalence relation on the 
set of subgroups of a group. 

Let G be the group and let H x , H 2 , and // 3 be 
subgroups of G. H x — <#> “’//,<#> where £ = id G G; so the 
relation is reflexive. If //, = <J> ~ l H 2 <p, then H 2 = </>#,< f> _1 
where \p = $~ 1 GG; so the relation is 
symmetric. If H x =<j>~'H 2 ^> and H 2 = then 

H x = = t~ 1 H 3 t where t = ip<j> £ G; thus 

the relation is transitive. 

4. Show that conjugacy is an equivalence relation on the 
set of finite extensions of a field F. 

Let F,, F 2 , and F 3 be finite extensions of F. By the very 
definition, the relation is reflexive and symmetric. To 
show transitivity, the hypothesis is that F, = F(r,) and 
F 2 = F(r 2 ), with r, and r 2 conjugates; also F 2 = F(j 2 ) and 
F 3 = F(j 3 ), with s 2 and j 3 conjugates. Let / and g denote 
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the minimal polynomials (over F) of the r- s and the s', s, 
respectively, and then let E be the splitting field of the 
product fg. Since E is normal over F, Theorem 33 applies, 
thus enabling us to convert the question into one about 
groups. Since G(E/F,) and G(E/F 2 ) are conjugate and 
also since G(E/F 2 ) and G(E/F 3 ) are conjugate, by the 
previous problem G(E/F,) and G(E/F 3 ) are conjugate. 
By the theorem again, F, and F 3 are conjugate. (As an 
alternative, one could use Lemma 33.) 

5. If E is a normal extension of F and if K is an 
intermediate field, can every automorphism in G(K/F) be 
extended to an automorphism of E? 

Yes. If K = F(r), the given automorphism must map r 
to one of its conjugates r jy and it is uniquely determined 
by this rj. By the proof of Lemma 33, there is a 
</> G G(E/F) which does this. 

6. Let E be an extension of F, and rf> be an 
automorphism of F. Can <J> necessarily be extended to an 
automorphism of E: (a) in general ? (b) if E is a finite 
extension of F? (c) if E is a normal extension of F? 

The answer is negative in all three cases, as the follow- 
ing example shows. Let F = Q(V2 ) and E = F(V2”), where 
the roots refer to the unique positive roots. E is normal 
over F since it is the splitting field of x 2 -fl over F. (Of 
course E is not normal over Q.) An automorphism of F is 
determined by the condition ) = - V2 , since f2 and 

- \J2 are conjugates over Q. Letting 5 = \Jl , suppose $ 
could be extended to E. We would then have: f2 = s 2 , 

- V2 = <t>(s 2 ) = <p(s) ■ <!>(s) = [<£(.s)] 2 , a contradiction. 

7. Let r, and r 2 be conjugates over a field F and suppose 
that r 2 ^F(r 1 ). Prove or disprove: F(r,) n F(r 2 ) = F. 
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The assertion is not necessarily true. First let us 
construct the general form of a counterexample and then 
a particular case. Let F be a finite extension of Q and let 
K, be a nonnormal finite extension of F. If we write 
K, = F(i,) for an appropriate s,, then this s ] must have 
some conjugate s 2 over F such that s 2 ^K,. Otherwise K, 
would be the splitting field of the minimal polynomial of 
j, over F and hence would be normal over F. Now let E 
be the splitting field just mentioned, and define 
K 2 = F(s 2 ). Since K, and K 2 are conjugate over F, there 
exists a <j> E G( E/F) such that <KK,) = K 2 . If it happens to 
be the case that E is also a normal extension of Q, then 
since <f> E G( E/Q) as well, it follows that K, and K 2 are 
conjugate over Q. Thus we can write K, = Q(r,) and 
K 2 = Q(r 2 ) for some conjugates r x and r 2 . Nevertheless, 
contains all of F and hence more than just 
Q. 

The following particular example is of the type 
described. Let F = Q(V2 ), K, = F( 3 V2), and K 2 
= F(w yi ), where w is a primitive cube root of unity. 
Then E = ¥(\/2 , u) is normal over both F and Q since it 
is the splitting field of x 3 - 2 and (x 3 - 2)(x 2 - 2) over F 
and Q respectively. Thus the required conditions are 
satisfied. 

Section 3.5 

1. If E is a normal extension of F and K, and K 2 are 
intermediate fields, describe G( E/K, n K 2 ). 

It is the smallest subgroup of G(E/F) which contains 
both G(E/K,) and G(E/K 2 ). Calling this subgroup H, we 
note that H is simply the set of all products of arbitrarily 
many automorphisms, some from G(E/K,) and the others 
from G(E/K 2 ); for this set is a group and it must be 
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contained in any group containing both G(E/K,) and 
G(E/K 2 ). Thus, K, n K 2 is contained in the fixed field of 
H. On the other hand, if fl K 2 , it cannot be in the 
fixed field; for if a#. K,, say, then there exists a 
< j > 6 G(E/K,) c H such that <£(a) ^ a, and similarly if 
a#. K 2 . Since E is normal over K, fl K 2 , the fixed field of 
H, we have by Theorem 32 that H = G( E/K, fl K 2 ). 

2. In the context of Problem 1, show that if further K, 
and K 2 are normal extensions of F, then G( E/K, fl K 2 ) 
= W I <f> e G(E/K,), i G G(E/K 2 )}. 

The H discussed in the solution to Problem 1 must 
certainly contain the given set S = | <p G G(E/K,), f 

G G(E/K 2 )}. Also, S contains each of the groups 
G(E/K,) and G(E/K 2 ). Thus it suffices to show that S is 
itself a group. By the fundamental theorem on Galois 
theory, for every r £ G(E/F), r[G(E/K,) ] = [G(E/K,)]r, 
i = l,2. Thus, with the obvious notation, ~ 1 
= e S; = 

e an d> °f course, id = (id)(id) G 5. Thus 5 is 

a group. 

3. If E is a normal extension of F with intermediate 
fields K, and K 2 , and if K is the smallest field containing K, 
and K 2 , what is G(E/K)? 

G(E/K) = H l fl H 2 , where = G(E/K,) and H 2 
= G(E/K 2 ). For if <p G G(E/K) then $ leaves fixed all 
elements of K, which contains both K, and K 2 . Thus 
<t> G H x H H 2 . Conversely, if <j> G //, D H 2 , then it leaves 
fixed the set of all rational combinations of elements of K, 
and K 2 ; but this set is easily seen to be a field, and hence 
it is K itself. Thus $ E G( E/K). 

4. Let w be a primitive fifth root of unity. Find a field 
F such that [F(w) : F] = 2. 
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Recalling the solution to Problem 1 of Section 2.7, we 
may take F = Q(V5" ). 

5. Let to be a primitive 3 lit root of unity. Show that 
there exists a field F such that [F(to) : F] = 6. 

By Problem 1 of Section 3.3, G(Q(to)/Q) is cyclic of 
order 30. If <f> generates this group, the subgroup 
generated by <j> 5 has order 6. Letting F be its fixed field, 
by the normality of Q(to) over Q and hence over F, we 
have [Q(w) : F] = [F(«) : F] = |G(F(co)/F)| = 6. 

6. Find the order of the Galois group of x 4 — 2 and 
then describe completely, in some explicit way, each of its 
elements. 

If E is the splitting field of x 4 - 2, then [E : Q] = 8. 
For, E = Q(i, 4 V2 ) and [E : Q] = [E : Q( \j2 )] • 

[Q( 4 V2 ) : Q] - [degQ^/'HdegQ ] = [2][4] = 8. But then 
the group has order 8, since E is a normal extension 
of Q. As in the proof of Theorem 19, a basis for the 
extension is^ given by { 1 , \/2~ , ( \/2 ) 2 , ( 4 f2 ) 3 , i, i \jl , 
/'( V 2 ) 2 , i(y/2 ) 3 }. Thus an automorphism cj> E G( E/Q) is 
completely described by its action on i and ^2 . Since </> 
must preserve the roots of x 2 + 1 and x 4 - 2, we have 
<#>(/) = ± i and <p( 4 t/2 ) = ± \[2 or ± i 4 f2 . All eight 
combinations yield the elements of the group. We may 
call these <f>,, <j> 2 , . . . , <f> g and describe their behavior by 
the following table: 



<f>i 

$2 

<>3 

</>4 

^5 

<t>6 

fa 

$8 

i 

4 V2 

i 

i \/2" 
i 

i 

-/ 4 V 2 
/ 

4 V2 
— / 

'V2 

— i 

~ 4 V2 

— i 

-i 4 fl 
— i 


7. Find all the subgroups of the group obtained in the 
previous problem, and determine which ones are normal 
subgroups. 
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By computation, we find that <#»| = 4> 3 > <t > 2 ~ and 
</>2 = id = Also, <p 3 = id, <f> $ = <f> 2 <p 3 , <f >7 = ^^d 

<t> s = <#> 2 <f> 5 - Lastly, <j> 5 <p 2 = <f> 29 s- Since everything can be 
expressed in terms of <j> = <j> 2 and ip = <t> 5, we can rewrite 
the group as {id, <p, <p 2 , <j> 3 , \p, <p\p, <p 2 t p, and <p 3 ip), where 
<p 4 = \P 2 = id and ip<j> = <p 3 ip. 

Other than the entire group G and {id}, there are 
four normal subgroups: A, = {id, <p, <p 2 , <£ 3 }, N 2 = {id, 
<pip, 4> 2 , <p 3 xp), N 3 = {id, i p, <p 2 , (pfy}, and N 4 = {id, <p 2 }. 
There are four nonnormal subgroups: //, = {id, tp}, 
H 2 = {id, H 3 = {id, and H 4 = {id, <p 3 \p). (These 
answers are the results of a number of computations; but 
by Lagrange’s theorem, only subsets with 2 or 4 elements 
need be tried.) 

8. If E is a normal extension of K and if K is a normal 
extension of F, does it follow that E is a normal extension of 

F? 

No. Let E, K, and F be the fixed fields of H { , N 3 , and 
G, respectively, in the solution to the previous problem. 
By the normality of H ] in N 3 (as can be verified) and of 
N 3 in G, E is normal over K and K is normal over F. 
However, E is not normal over F because //, is not 
normal in G. 

9. If H is a normal subgroup of a group G and if N is 
a normal subgroup of H, must N be a normal subgroup of 
G? 

No, as shown in the solution to the previous problem. 

10. Why should it be obvious that the Galois group 
calculated in Problem 6 is solvable ? Verify its solvability 
directly. 

Since jc 4 — 2 is obviously solvable by radicals, its group 
must be solvable. The sequence G d N 3 d D {id} 
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fulfills the definition of solvability. Another such sequence 
is G D w, D N 4 d {id}. 

11. 7/F is a field containing no pth roots of the number 
A £ F, for a fixed prime p, show that the polynomial x p — A 
is irreducible over F. 

Suppose x p — A has a monic factor g(x) e F[x] with 
degree m, 1< m < p -1. If u is a primitive pth root of 
unity, all the roots of x p - A and hence g have the form 
u k a, where a is a particular pth root of A. Thus the 
constant term of g, g 0 , has the form ± u J a m for some J, so 
that u J a m e F. Since (m, p) =1, we can choose integers s 
and t by the Euclidean algorithm such that sm + tp = 1. 
But then a pth root of A in F would be given by 
( u J a m YA‘ , since (( <t J a m ) s A‘) p = (u p ) sJ (a p ) sm A ,p = A. 


Section 3.6 

1. Let u be a primitive fifth root of unity, and label the 
roots of x i - 1 as: r, = 1, r 2 = «, r 3 = w 2 , r 4 = u 3 , r 5 = w 4 . 
Give an example of a permutation of these roots which does 
not correspond to an element of the Galois group of the 
polynomial. 

The transposition (1 2) cannot be in the group, for, 
since 1 G Q, 1 must be unchanged by every element of the 
group. 

2. Label the roots of x* — 2 as: r, = +\jl ( positive 
real), r 2 = ifl, r 3 = - }/2 , r 4 = — / f2 . Give an example 
of a permutation of these roots which does not correspond to 
an element of the Galois group of the polynomial. 

The 3-cycle (13 4) cannot correspond to an element of 
the group, since if <f>(r,) = r 3 , then <j>(r 3 ) = <K~ r \) ~ ~ r s 
= r x * r 4 . 

3. Prove or disprove: If f E F[x] is irreducible, then its 
Galois group over F is transitive. 
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The result is true, and the proof given in the first 
paragraph of the proof of Lemma 35a may be used 
verbatim with p replaced by n and Q replaced by F. The 
hypothesis that p is prime was not used in that part of the 
proof. 

4. Find a fifth degree polynomial of the form 
x s + bx + c, such that it is not solvable by radicals. 

The polynomial fix) = x i - 5x + 5/2 is such a poly- 
nomial. It has a positive local maximum at x = — 1 and a 
negative local minimum at x = + 1, and it has no other 
stationary points. Thus it has one real root on each of the 
intervals (- oo, - 1), (-1, 1), and (1, + oo). By Lemmas 
35a and 35b, it is not solvable by radicals. 

5. Iff £ F[x] is irreducible and there is a root r of f and 
a sequence of radical extensions F = F 0 c F, c • • • C F N 
such that r £ F^, show that f is solvable by radicals over F. 
( That is, if one root of an irreducible polynomial can be 
obtained by radicals, all the roots can.) 


For each j, we are given that F /+ , = F fiaf), for some aj 
such that ap £ F y for some n y . Let us define A ] = ap and 
let the conjugates of Aj over F be Aj, Aj, . . . ,A"j. Then 
the polynomial gfix) = (x n > — /LX*"' - Aj) ■ • • (*”' — A'j) 
is readily seen to be over F, for its coefficients are 


symmetric polynomials evaluated at the conjugates of Aj. 
We prove by induction on j, 0 < j < N - 1, that there 


exists a field E y+1 normal over F, obtainable by a sequence 
of radical extensions from E y , and such that F y+1 cE y+1 . 
The normality of E N over F implies that E^ contains all 
the roots of /. For j = 0, let E t be the splitting field of 


x n ° - A 0 over F; this clearly suffices. For an arbitrary j, 


the normality of E y over F implies that A } and all its 
conjugates over F are in E y and thus can all be obtained by 
radical extensions. Extend E . to E y+1 by successively form- 
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ing the splitting fields of x n > - Aj, x”> — A',..., x n > - A". 
E y+1 is normal over F since it is the splitting field of 
hj(x) = n/_ 0 fc(x) over F, and it clearly satisfies the other 
required conditions. 

6. Suppose that the group of a polynomial f over a field 
F is S n . Show that f is irreducible over F. Use this result to 
show that if in addition n > 5, then no root of f may be 
expressed in radicals. 

If / were reducible over F, then two of its roots would 
belong to different factors, and so they would not be 
conjugates. Thus the group of / over F would not contain 
any automorphisms mapping one of these to the other. 
Therefore the group could not contain all the permuta- 
tions of the roots of /, and so it could not be S„. If in 
addition n> 5, then the polynomial is not solvable by 
radicals over F. By the previous problem, none of the 
roots may be expressed in radicals. 

7. Show that for every prime p, there exists a polynomial 
of degree p over Q which has S p for its Galois group. 

By Lemma 35a, it suffices to find an irreducible 
polynomial of degree p Over Q which has exactly two 
nonreal roots. For p — 2, x 2 + 1 suffices; for p - 3, x 3 — 2 
suffices. Thus we may assume p > 5. Our construction 
only depends on the fact that p is odd. 

Consider the function 

g(x) = (x 2 + m)(x - 2)(x - 4) • • • (x - 2(p - 2)), 

where m is an even positive integer to be determined later. 
When x takes on the values 1, 3, 5, .... 2p - 3, the/? - 1 
values of g(x) are integers greater than 2 in absolute value 
and alternating in sign. Thus the function f(x) = g(x) - 2 
will also alternate signs at these p - 1 points. By the 
intermediate value theorem, / has at least p - 2 roots 
between 1 and 2p — 3. 
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We saw in Problem 3 of Section 1.6 that the sum of the 
squares of the roots of a polynomial 2"-o a j xi is 

/ a n-\ \ 2 2a„_ 2 

l ’ 

which does not depend on a 0 . Thus this sum is the same 
for / as for g, since these polynomials only differ in the 
constant term. By considering g, we see that this sum is 

2 2 +4 2 + ••• +[2(/>— 2)] 2 — 2m, 

and so for m sufficiently large it is negative. Therefore at 
least one root of / is not real. But its complex conjugate 
must also be a root. Thus we know we have p — 2 real 
roots and 2 nonreal roots, and these account for all p 
roots. 

The resulting polynomial / is also irreducible, since it 
satisfies Eisenstein’s criterion with the prime 2. (Note that 
2 2 divides the constant term of g(x), so it cannot divide 
the constant term of f(x) = g(x) - 2.) 

8. For every n, show that there exists a field F and an 
nth degree polynomial f E F[x] such that the group of f over 
F is S„. (Hint: Review Problems 6 and 7 of Section 3.3.) 

Choose numbers a,, a 2 , . . . , a„ that are algebraically 
independent over Q. Let E = Q(a,, a 2 , . . . , a„). Let 
b, - ( - Ifafa,, a 2 , , a n ), where a, is the ith elementary 
symmetric function. Define K = Q(Z>,, b 2 , . . . , b„). Then 
we have Q c K c E. Each permutation of the a’s induces 
an automorphism of E (Problem 7, Section 3.3); and by 
the symmetry of the b,' s, such automorphisms leave K 
fixed. Thus |G(E/K)| > «!. However, E is the splitting 
field over K of /(jc) = 117— iC* — a i)- Therefore |G(E/K)| 
< n!. We conclude that |G(E/K)| = «!, and thus 
G(E/K) = S„. F = K satisfies the requirements of the 
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problem. (The factor (-1)' in the definition of the 6/s 
was not necessary; it was introduced for consistency with 
later work.) 

Section 3.7 

1. What is the group of an irreducible quadrat icl Use 
the definition of a solvable group to show that it is a 
solvable group. Then carry through the procedure used in the 
first part of the proof of Galois' Theorem to derive an actual 
formula for its solution, which formula should turn out to be 
the quadratic formula. 

The group is S 2 . For, if r x and r 2 are the roots of the 
quadratic, by Theorem 3 1 there is a unique <j> in the Galois 
group such that <j>( r ,) = r 2 , this is denoted (1 2). Thus, 
both (1 2) and of course the identity are in the group, and 
so the group is S 2 . S 2 is solvable since S 2 D {id} is a prime 
factor composition series, the single factor being 2. Now 
let us represent the quadratic as f(x) = ax 2 + bx + c. Our 
primitive square root of unity is just w = - 1. As in the 
proof of Galois’ Theorem, we define numbers a 0 and a,: 
a 0 = r, + r 2 , 
a, = r, - r 2 . 

We know that a 0 = - b/a ; and by the proof cited, we 
know that a 2 should be in the coefficient field. By 
calculation we find that a\ = r 2 - lr x r 2 + r\ = (r, + r 2 ) 2 
— 4r { r 2 = b 2 / a 2 - 4c / a. Consequently, r t — r 2 = 

±yb 2 /a 2 — 4c/a = (± sjb 2 - 4 ac )/ a. Combining this 
with r x + r 2 =-b/a, we obtain the usual quadratic 
formula. 

2. Repeat the previous problem for a cubic equation 
under the assumption that the group is S y Does your 
solution depend on the fact that the group is all of S 3 ? (C/. 
Problem 1 of Section 3.1). 
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Let the cubic be f(x) = ax 3 + bx 2 + cx + d with roots 
r„ r 2 , and r 3 . We have 

«o - r \ + r 2 + r 3 

a, = r, + ur 2 + u> 2 r 3 
a 2 = /•[ + w 2 r 2 + i or 3 

since u 4 = u (for <o is a primitive cube root of unity). Let 
us take w = (-1 + jVJ)/ 2. We now repeat the process 
used in the proof of Lemma 34b. In particular, we shall 
verify by direct calculation that al, a\, and a\ are in a 
radical extension of the coefficient field. For a\, this is 
obvious. An exhausting computation sho ws th at a\ = 
(— b/a ) 3 + 9 (be /a 2 - 7>d/a)/2 — (3j'/ 2)^3A 2 , where 
A 2 = —4 b 3 d /a 4 — lid 2 / a 2 + \&bcd/a 3 —4 c 3 /a 3 + b 2 c 2 /a 4 . 
(A 2 is called the discriminant of the cubic; it equals 
n, < j(r t ~ rj) 2 .) Similarly, a\ = {-b/a) 2 + 9 (be /a 2 - 
3d /a )/2 + (3i /2)v^3A 2 . Taking an arbitrary cube root in 
the first expression as the value of a 1 and computing a 2 
from the relation a x a 2 = b 2 /a 2 - 3 c/a, we finally are able 
to obtain: 

r { = (-b/a + a x + a 2 )/3 
r 2 = + u 2 a | + wa 2 )/ 3 

r 3 = (- fc/a + coa, + w 2 a 2 )/3. 

This solution does not depend on the assumption that 
the group is S 3 . 

3. Let f be a cubic polynomial with real coefficients 

and let F be the smallest field containing the coefficients. If 
r,, r 2 , and r 3 denote the roots of /, we define the discriminant 
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of f to be A 2 = n (< ^(r, — rf) 1 . Show that A 2 e F. Show 
further that if A 2 > 0 all the roots are real and if A 2 < 0 
there is exactly one real root. What if A 2 = 0? 

A 2 e F since it is a symmetric function evaluated at the 
roots. If all the roots are real, it is clear that A 2 > 0. Thus, 
if A 2 < 0, there must be a nonreal root. Whenever there is 
a nonreal root, its complex conjugate must also be a root 
(Problem 11, Section 2.2), and so there must be exactly 
two nonreal roots, and hence exactly one real root. Let us 
denote such nonreal roots by a ± bi, where a, b E R. 
Calling these roots r x and r 2 , and the real root r 3 , then 
r { - r 2 = 2 bi. The product (r, - r 3 )(r 2 - r 3 ) is easily seen 
to be real, so that A 2 = -4 b 2 [(r, - r 3 )(r 2 - r 3 )] 2 < 0. 
Thus, if A 2 > 0, the roots must be real. By the same 
argument, if A 2 = 0 the roots are real, and in this case at 
least two of them must be equal. 

4. If r is any root of the irreducible cubic f, over some 
field F, and if A is a square root of the discriminant, show 
that the splitting field E of f, over F, is given by F(A, r). 

Since A = ±(r, — r 2 )(r l — r 3 )(r 2 - r 3 ), where the r,’s are 
all the roots of /, it follows that F(A, r) c E. Let us take 
r, = r. Since (x - r,) divides / in F(A, r)[x], the quotient 
polynomial (jc - r 2 )(x - r 3 ) = x 2 — ( r 2 + r 3 )x + r 2 r 3 also 
has coefficients in the field F(A, r). Substituting x — r x , it 
follows that (r, - r 2 )(r, — r 3 ) E F(A, r) and so, from the 
expression for A, r 2 - r 3 E F(A, r). Since also r 2 + r 3 
E F(A, r), as it is the negative of the coefficient of a 
polynomial over this field, it follows that r 2 and r 3 both 
E F(A, r). Consequently F(A, r) = E. 

*5. Let f be an irreducible cubic over Q with three real 
roots. Show that it is not possible to solve for any of its roots 
by real radicals alone. ( This may be surprising.) 

We adopt the notation of the previous two solutions. 
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Suppose there were a sequence of real radical extensions, 
Q = F 0 c F, c • • • C ¥ n , such that F N contains a root of 
/. Then the additional real radical extension F^(A) would 
contain all the roots of /, by the result of the previous 
problem. As in the proof of Galois’ Theorem, we may, 
without loss of generality, assume that each radical is of 
prime order. Since A 2 e Q (Problem 3), it is possible to 
reorder the radical extensions above so that F, = Q(A). 
Let F m be the first field in the resulting list containing a 
root of /. By the previous problem, then, F M contains all 
the roots of /. If F jW = F lW _,(a) where a^Fj,., and 
a £ F m _ 1 (the nature of a radical extension of prime 
order), then since / has no roots in F*-! and hence is 
irreducible over F*-i (for if it were reducible, it would 
have a linear factor and hence a root), we must have 3| p 
and so 3 = p. The splitting field of / over F m -i must be all 
of F m , because the fact that [F a/ :F m _i] is prime precludes 
the existence of nontrivial intermediate fields. Therefore 
F m is normal over F u _ l . Since x 3 - a 3 has no roots in 
F*_i, not a by assumption and no others since they are 
not real, x 3 — a 3 is irreducible over F*-x. By definition of 
normality then, all the roots of x 3 — a 3 must be in F*. 
which contradicts the assumption that it is a real field. 

6. Use Galois' Theorem to show that every quartic is 
solvable by radicals. (C/. Problem 4, Section 3.1.) 

S 4 is solvable because a prime power composition series 
can be shown to be S 4 3> A 4 d H 1 d H 2 3 {id}. Here 
A 4 is the set of even permutations, H x = {id, (1 2) 
(3 4), (1 3X2 4), and (1 4)(2 3)}, and H 2 = {id, (1 2)(3 4)}. 
The respective indexes of each subgroup in the previous 
one are 2, 3, 2, 2. Now if H is a subgroup of a group G 
and if the index of H in G is 2, it is easy to see that H 
must be normal. For if g e G, then both gH and Hg are 
either both H or both the complement of H in G, 
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depending on whether g E H. Thus only the normality of 
H x in A 4 needs to be worked out, and this straightforward 
verification will be omitted. If H is any subgroup of S 4 , 
then it can also be verified that H d (H n A) d (H n H x ) 
D (H fi H 2 ) d {id} is a prime power composition series 
for H, where any repetitions are omitted. Thus any quartic 
has a solvable group, and is thus solvable by radicals. (It 
can be shown more generally that any subgroup of a 
solvable group is solvable.) 


Section 4.1 

1. The function e‘ is sometimes defined by the power 
series 2“-o t n /n\. Show that this series converges for all t. 
Then use multiplication of series to prove the law of 
exponents e a e h = e a+b . 

The series is absolutely convergent by the ratio test, 
since the ratio 

i»r'/(» + i)! 

i<r/»! — w/( " +1) 

approaches 0 as n-*oo, for every t. For the law of 
exponents, we have: 


-im-Tfir) 


n\ 


= y — y 

«=o n\ faj\{n-j)\ 
“2 (« + *)" 


a J b n 1 


n-0 


= e a+b . 
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2. On the basis of the fact that power series can be 
added term by term within their common radius of 
convergence, show that two series of the form f(t) 
= 2 £°_ _j a k t k and g(t) = 2 “_ ~jb k tk can also be added 
term by term. Here j > 0 and f and g both converge when 
0<\t\<R. 

Since t J f(t) and t J g(t) are both convergent power series 
for |r| < R, they can be added term by term. Thus 
t J [f«) + g(t)}=2^-j(a k + b k )t k+ J. Division by t> now 
gives the result. 

3. On the basis of the definition of series of the form 
2 r~o a k ,k ‘ n the case when the a k 's may not be real, justify 
the fact that two such series may be multiplied in the usual 
way within their common radius of convergence. 

Let 2“-o a k t k an d be two such series. We 

write the real and imaginary parts of the coefficients as 
a k = b k + ic k , Aj = Bj + iCj. Within the common radius of 
convergence of the original two series, the four real series 

OO 00 00 00 

2 b k t k , ^ c k t k , 2 Bjtf 2 CjtJ 

all converge. We want to show that the product series is 
convergent and that the coefficient of t n is '2, n k ^ a a k A n _ k . 
If we compute the product, we obtain 

00 00 1 f 00 00 

2 h k‘ k + i 2 c k t k 2 B j‘ J + ‘ 2 C j‘ J 

k = 0 k=0 \ 7 = 0 j—0 

OO 00 00 00 

= 2 b k t k 2 BjtJ - 2 c k t k 2 CjtJ 

k - 0 y = 0 A: = 0 y = 0 

00 00 00 00 

+ i 2 c k t k 2 BjtJ + 2 b k t k 2 CjtJ 

k - 0 j- 0 1 k—0 j = 0 
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which is convergent by the rules for real series. 
Straightforward computation yields the expected 
coefficients. 


4. Prove Taylor’s Theorem for the case of a 
polynomial : If f is a polynomial of degree n in y, then 

f(2)( y \ 

f(y) = f(yo) + f (l \yo)(y - y 0 ) + 2 \ ' ^ - ^°) 2 


+ f (n \yo) , 

+ : — ( y~y 0 ) ■ 


n\ 


Here y 0 is an arbitrary fixed value of y. 


For n = 0, / is a constant and the result is obvious. If 
the statement is true for all polynomials of degree < n, 
then it is true for f’(y). Thus we may write 

f\y) ■ f(y o) + - n,) 


+ P\?o) , 


n - I 


Integration of both sides from y 0 to y now yields the 
result. 


5. Prove the following version of Gauss’ Lemma: If 
f(t, x) is a polynomial in two variables over F and if f can 
be written as a product f(t, x) = r(t, x)s(t, x), where r and 
s are polynomials in x whose coefficients are rational 
functions of t over F, then in fact f may be written 
f(t, x) = r(t, x)s(t, x) where r and s are polynomials over F 
in two variables. Furthermore, r and r have the same degree 
in x, as do s and s. 

The proof of Lemma 12 applies almost verbatim, except 
that “± 1” should be replaced by “constant polynomial” 
and “prime” by “irreducible polynomial in t”. 
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6. Prove that the function fl — t is analytic at 0. 

We seek a power series '2 l t =0 b k t k whose square is 1 — r. 
A necessary condition on the b k 's is obtained by squaring 
the proposed series and comparing coefficients with 1 - t. 
In particular, 



k 


2 bjb k _j 

j - o 


t k = 1 - t, 


so that bl = 1, 2b 0 b { = - 1, and 2 b 0 b k = - I'^A -y 
Clearly b 0 = 1 for the positive square root, hence 6, = 
-1/2 and we have a recursion relation determining the 
b k . If we can show that the resulting series has a positive 
radius of convergence, then we can conclude that it is a 
solution. By induction we can actually obtain a closed 
form expression for each b k , k > 3, namely 


{2k - 3)! 3 • 5 • • • • (2 k- 3) 

>k ~ 2 2k ~ 2 k\ (k - 2)! ” 2 k k(k - 1)! 


from which we conclude that |Z>*| < 1/4 k. The compar- 
ison 


2 IV*I< 2 4tM‘< 2 M* 

shows that our series certainly converges for |/| < 1. The 
addition of the first three terms cannot affect conver- 
gence. 

An alternative approach is to apply Taylor’s Theorem 
from calculus to f{t)=f \ - t . This easily yields the 
coefficients b k from the formula b k = f (k \0)/ kl, and thus 
we get a convergent series. The remainder term must be 
analyzed to show that the series converges to /(/). 
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*7. Let f(t ) be analytic at 0 and suppose /( 0) > 0. Show 
that f has an analytic square root at 0; that is, show that 
there is a function g(t) defined in some neighborhood of 0 
such that g(t) is analytic at 0 and [ g{t)f = /(/). 

If g(t) is one such function, then of course — g(t) is 
another. Let us seek the positive square root, so that 
2(0) > 0. Define x = g(t); we want to solve for x, 

x 2 = a 0 + af + a 2 t 2 + • • • , 

in the form of a power series in t. By changing variables 
we can transform this problem so as to make it amenable 
to the argument used in the proof of Lemma 36b. Without 
loss of generality we may take a 0 = 1 . The variable change 
y — 1 — x then results in an equation 

OO 

° = c d-y + c o y 2 + 2 c k‘ k ’ 

k = 2 

where c 0 = 1/2 and c k = -a k / 2 for k > 1. The exact 
method of the proof of Lemma 36b yields formulas for 
the coefficients in the proposed expansion y = 2”_, b k t k . 
To show convergence, a slight modification is necessary 
because there are an infinite number of the c,’s and we do 
not know they are bounded. But the convergence of 
obviously implies that of 2"_ 0 c fc f*, sa y 
absolutely on the set |/| < r. Thus as k -> oo, c k r k -> 0, and 
so there is a constant A such that for all k, | c^l < Ar~ k . 
As in the proof of the lemma, replacement of each c k by 
this bound yields a series for y, / which 

dominates the original. But this series is the solution to the 
problem 


0.A(i)- y + Ay^Al(L)\ 
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which is equivalent to 

0 - Af - y + 

for |t| < r. From the quadratic formula, we see that y may 
be expressed as a power series in t/ r, for |/| sufficiently 
small, and hence as a power series in t. Thus 
g(t) = x = 1 — y may also be so expressed. 

8. Consider the algebraic curve C defined by f(t, x) 
= 0. A real point of C is a point where both t and x are 
real. For example, the real points of t 2 — x 2 = 0 form two 
lines in the real t, x-plane. The curve t 2 + x 2 = 0 has only 
(0, 0) for a real point. A real point is isolated if it has some 
neighborhood containing no other real points, as in the 
second example. Show that if f(t, x) has real coefficients, 
then for every regular value of t, any corresponding real 
point cannot be isolated. 

In this case the coefficients of the power series for the 
root function, as derived in the proof of Lemma 36b, are 
all real. Thus the function is real and the points (/, *(/)) 
represent a real curve passing through the point in 
question. 

9. Suppose that y = g(t) is a polynomial of degree m 
over C. If there exist m + 1 values /, E Q at which 
g(f) E Q, show that the coefficients of g(t) must actually be 
in Q. 

The coefficients of g(t ) may be determined by a system 
of m + 1 equations as in the proof of Lemma 36c. The 
solution is obtained by rational operations on the values 
of f, and g(t,). Thus in this case the coefficients are 
rational. 
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Section 4.2 

1. Verify that Kronecker's specialization gives a 
one-to-one correspondence between the sets P d and K d . 

There is a one-to-one correspondence between the 
ordered n-tuples (/,, i 2 , . . . , /„), each ij between 0 and 
d- 1 inclusive, and the integers from 0 to d n — 1. The 
n-tuple is simply the base d representation of the integer. 
From this we deduce a one-to-one correspondence 
between the monomials XI"_ { uf and y‘i +, ^ + " ' +l * d " ', 
and the extension to polynomials is immediate. 

2. If g, h, and gh are all in P d > show that ( gh ) = gh. 

Let h = a(M 0 )Il"- x uf and let g = b(u 0 )u k for some k, 
1 < k < n. By the definition of g and h, it is clear that the 
result holds in this case. By repeated application of this 
fact, the result holds for any monomials g and h. For any 
finite sum Vm k of such monomials, it is clear that 
^m k Thus if we write g = '2 d m k and h='£ l n J , 

where the m k s and nfs are monomials, we obtain 

( 7 h ) = = ( This 

argument holds both when u 0 is a variable and when it is a 
constant.) 

3. In the example immediately preceding Lemma 36 c, 
verify that the factorization f(y) — y\ 1 + y ) leads back to 
a product gh which is not in P d . 

Here recall that we are using d = 3. Therefore from 
g(y) =y 2 we get g(u x , u 2 ) = uf, also from h(y) = 1 +/ 
= 1 +yy 3 , we get h(u t , u 2 ) — 1 + n,n 2 . The product 
gh = u x + u x u 2 is not in P d because the exponent of u, in 
the second term is not less than d = 3. 

4. This problem refers to the proof of Theorem 36. 
Define gj(u 0 , «„ . . . , u„) by the requirement that g,(u 0 ,y) 
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= G,(m 0 , /). Does it follow that q(u 0 , u n ) 

=n,6s&("o» «.)? 

Certainly not. This is essentially the point of the 
previous problem as well as the example in the text. In 
particular, ILes£,( u o> M i> • • • - «„) may not be in P d . 

5. Let f(t, x ) be an irreducible polynomial in two 
variables over Q, and let 3C be the set of all rational 
numbers a such that f(a, x) is irreducible as a polynomial 
over Q in x. % is called a basic Hilbert set. Lemma 36 d 
shows that % is infinite. Prove that 3C is dense in the real 
line R. (A set S C R is said to be dense in R if for every 
r E R there exists a sequence of numbers s n E S converging 
to r.) 

The solution uses the properties of the changes of 
variables discussed at the beginning of the section, and so 
let us adopt the notation there. Let r be a fixed real 
number. For each n we want to define a number s n E%, 
such that these s n ’s converge to r. 

Let n be fixed and choose t 0 to be a rational regular 
value of t such that |/ 0 — r| < 1/2 n. The corresponding 
polynomial g(t, x) has precisely the properties to which 
the proof of Lemma 36d applies, and so there exist 
arbitrarily large rational values t ] such that g(r,, x) is 
irreducible. If we pick any such t x >2n, then f(t 0 + 
(1 /t x ),x) is irreducible. The number s„ = /„ + (1/f,) is 
within distance 1/2 n of t 0 , hence within distance 
< (1 /2m) + (\/2n) = \/ n of r, and it belongs to %. The 
resulting sequence of .y„’s is therefore of the required type. 

Section 4.3 

1. In the context of Lemma 37b, are there any 
automorphisms of E leaving F fixed other than those induced 
by the n\ permutations of the a ’si 
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Yes, there are an infinite number of such auto- 
morphisms. Let r be any rational number other than 0 or 
1, and define 5, = ra v It is clear that a„ a 2 , ■ . ■ , a n are 
algebraically independent over F and that E may also be 
written E = F(u,, a 2 , ■ ■ . , a n ). By Lemma 37b, any 
permutation of 5,, a 2 , . . . , a n yields an automorphism of 
E leaving F fixed. Furthermore, all permutations which 
are not the identity on a { are different from those arising 
from permutations of a t , a 2 , . . . , a n . Repeating for 
different values of r, we obtain an infinite number of such 
automorphisms. Of course, none of these automorphisms 
can leave K fixed, for the original automorphisms already 
provided the maximum number, «!, of elements of 
G(E/K). 

2. Are the elements b v b 2 , . . . , />„, as defined in the 
text, algebraically independent over Q? 

Yes. If we had a nontrivial polynomial p(t lt t 2 , , t n ) 

over Q such that p(b „ b 2 , ... , b„) = 0, then substitution 
for each t, in terms of the sfs, as defined in the text, would 
yield a nontrivial polynomial in the sfs, say q(s x , s 2 , 
. . . , s n ). But then we would have q(a v a 2 , . . . , a n ) 
= p{b x , b 2 , ... ,b n ) - 0, contradicting the algebraic inde- 
pendence of the a,’s. 

3. Let H{t v t 2 , . . . , t n , x) be a polynomial over Q in 
n + 1 variables. Let b { , b 2 , . . . , b n be n complex numbers 
which are algebraically independent over Q. Define a 
polynomial h(x) = H(b v b 2 , . . . , b n , x) and consider it 
over K = Q(6„ b 2 , ... , b n ). What is the relation between 
the irreducibility of H and the irreducibility of hi 

If h is irreducible, it does not follow that H is also 
irreducible, as shown by the example H(t, x) = ( t 2 + 1) 
(x 2 + 1) and any choice of a real transcendental number 
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ft, such as n. However, if we add the hypothesis that the 
coefficient of the highest power of x in H is unity, then we 
can conclude that H is irreducible. This follows 
immediately from the proof of Lemma 37f. 

On the other hand, if H is irreducible, then h is also 
irreducible. For if h is reducible, we know we can write 

H*) = [p k x k + • • * + />o] [?„*" + ■••+%] 

where the p ’s and q ’s are rational functions of n variables 
over Q, evaluated at the ft/s. By an argument analogous to 
that used to develop Gauss’ Lemma, we may assume that 
the p’s and q’s are actually polynomials over Q. The 
coefficient of x i in h(x) is a polynomial d over Q evaluated 
at the ft/s. Thus we have 

d(b v ft 2 , . . . , ft„) 

j 

* 2 Pi(K ft„H-,(ft„ ft* , b n ). 

i-O 

Since the ft/s are algebraically independent, the poly- 
nomial d - '2k+ m -jPk ( Jm must be identically 0. Since this 
holds for each j, we obtain the factorization 

4. Show that none of the roots of the polynomial 
determined in the proof of Corollary 31b can be constructed. 

The polynomial is irreducible. By the solution to 
Problem 5 of Section 3.6, if one root could be obtained by 
a sequence of quadratic extensions of Q, then every root 
could be so obtained. Therefore, if one root were 
constructible, all the roots would be constructible. 
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